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<210> 1 
<211> 30 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Arg His Arg Leu His Leu Ala Gly Phe Ser Ser Val Arg Tyr Ala Gin 
15 10 15 

Gly Val Asn Glu Asp Glu lie Met Leu Pro Ser Phe His Asp 
20 25 30 



<210> 2 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Ser Glu Ser Ser Gly Asn Asp Pro Phe Thr lie Val His Glu Asn Thr 
15 10 15 

Gly Lys Cys lie Gin Pro Leu Phe Asp 
20 25 



<210> 3 
<211> 1723 
<212> PRT 

<213> Mus musculus 



<400> 3 



Met Arg Thr Gly Arg Val Thr Pro Gly Leu Ala Ala Gly Leu Leu Leu 
15 10 15 



Leu Leu Leu Arg Ser Phe Gly Leu Val Glu Pro Ser Glu Ser Ser Gly 
20 25 30 

Asn Asp Pro Phe Thr lie Val His Glu Asn Thr Gly Lys Cys lie Gin 
35 40 45 

Pro Leu Ser Asp Trp Val Val Ala Gin Asp Cys Ser Gly Thr Asn Asn 
50 55 60 

Met Leu Trp Lys Trp Val Ser Gin His Arg Leu Phe His Leu Glu Ser 
65 70 75 80 

Gin Lys Cys Leu Gly Leu Asp lie Thr Lys Ala Thr Asp Asn Leu Arg 
85 90 95 

Met Phe Ser Cys Asp Ser Thr Val Met Leu Trp Trp Lys Cys Glu His 
100 105 110 

His Ser Leu Tyr Thr Ala Ala Gin Tyr Arg Leu Ala Leu Lys Asp Gly 
115 120 125 

Tyr Ala Val Ala Asn Thr Asn Thr Ser Asp Val Trp Lys Lys Gly Gly 
130 135 140 

Ser Glu Glu Asn Leu Cys Ala Gin Pro Tyr His Glu lie Tyr Thr Arg 
145 150 155 160 

Asp Gly Asn Ser Tyr Gly Arg Pro Cys Glu Phe Pro Phe Leu lie Gly 
165 170 175 

Glu Thr Trp Tyr His Asp Cys lie His Asp Glu Asp His Ser Gly Pro 
180 185 190 

Trp Cys Ala Thr Thr Leu Ser Tyr Glu Tyr Asp Gin Lys Trp Gly lie 
195 200 205 

Cys Leu Leu Pro Glu Ser Gly Cys Glu Gly Asn Trp Glu Lys Asn Glu 
210 215 220 

Gin lie Gly Ser Cys Tyr Gin Phe Asn Asn Gin Glu lie Leu Ser Trp 
225 230 235 240 

Lys Glu Ala Tyr Val Ser Cys Gin Asn Gin Gly Ala Asp Leu Leu Ser 
245 250 255 

lie His Ser Ala Ala Glu Leu Ala Tyr lie Thr Gly Lys Glu Asp lie 
260 265 270 

Ala Arg Leu Val Trp Leu Gly Leu Asn Gin Leu Tyr Ser Ala Arg Gly 
275 280 285 

Trp Glu Trp Ser Asp Phe Arg Pro Leu Lys Phe Leu Asn Trp Asp Pro 
290 295 300 

Gly Thr Pro Val Ala Pro Val lie Gly Gly Ser Ser Cys Ala Arg Met 
305 310 315 320 



Asp Thr Glu Ser Gly Leu Trp Gin Ser Val Ser Cys Glu Ser Gin Gin 
325 330 335 



Pro Tyr Val Cys Lys Lys Pro Leu Asn Asn Thr Leu Glu Leu Pro Asp 
340 345 350 



Val Trp Thr Tyr Thr Asp Thr His Cys His Val Gly Trp Leu Pro Asn 
355 360 365 

Asn Gly Phe Cys Tyr Leu Leu Ala Asn Glu Ser Ser Ser Trp Asp Ala 
370 375 380 

Ala His Leu Lys Cys Lys Ala Phe Gly Ala Asp Leu lie Ser Met His 
385 390 395 400 

Ser Leu Ala Asp Val Glu Val Val Val Thr Lys Leu His Asn Gly Asp 
405 410 415 

Val Lys Lys Glu lie Trp Thr Gly Leu Lys Asn Thr Asn Ser Pro Ala 
420 425 430 

Leu Phe Gin Trp Ser Asp Gly Thr Glu Val Thr Leu Thr Tyr Trp Asn 
435 440 445 

Glu Asn Glu Pro Ser Val Pro Phe Asn Lys Thr Pro Asn Cys Val Ser 
450 455 460 

Tyr Leu Gly Lys Leu Gly Gin Trp Lys Val Gin Ser Cys Glu Lys Lys 
465 470 475 480 

Leu Arg Tyr Val Cys Lys Lys Lys Gly Glu lie Thr Lys Asp Ala Glu 
485 490 495 

Ser Asp Lys Leu Cys Pro Pro Asp Glu Gly Trp Lys Arg His Gly Glu 
500 505 510 

Thr Cys Tyr Lys lie Tyr Glu Lys Glu Ala Pro Phe Gly Thr Asn Cys 
515 520 525 

Asn Leu Thr lie Thr Ser Arg Phe Glu Gin Glu Phe Leu Asn Tyr Met 
530 535 540 

Met Lys Asn Tyr Asp Lys Ser Leu Arg Lys Tyr Phe Trp Thr Gly Leu 
545 550 555 560 

Arg Asp Pro Asp Ser Arg Gly Glu Tyr Ser Trp Ala Val Ala Gin Gly 
565 570 575 

Val Lys Gin Ala Val Thr Phe Ser Asn Trp Asn Phe Leu Glu Pro Ala 
580 585 590 

Ser Pro Gly Gly Cys Val Ala Met Ser Thr Gly Lys Thr Leu Gly Lys 
595 600 605 

Trp Glu Val Lys Asn Cys Arg Ser Phe Arg Ala Leu Ser lie Cys Lys 
610 615 620 

Lys Val Ser Glu Pro Gin Glu Pro Glu Glu Ala Ala Pro Lys Pro Asp 
625 630 635 640 

Asp Pro Cys Pro Glu Gly Trp His Thr Phe Pro Ser Ser Leu Ser Cys 
645 650 655 



Tyr Lys Val Phe His lie Glu Arg lie Val Arg Lys Arg Asn Trp Glu 
660 665 670 



Glu Ala Glu Arg Phe Cys Gin Ala Leu Gly Ala His Leu Pro Ser Phe 
675 680 685 



Ser Arg Arg Glu Glu lie Lys Asp Phe Val His Leu Leu Lys Asp Gin 
690 695 700 

Phe Ser Gly Gin Arg Trp Leu Trp lie Gly Leu Asn Lys Arg Ser Pro 
705 710 715 720 

Asp Leu Gin Gly Ser Trp Gin Trp Ser Asp Arg Thr Pro Val Ser Ala 
725 730 735 

Val Met Met Glu Pro Glu Phe Gin Gin Asp Phe Asp lie Arg Asp Cys 
740 745 750 

Ala Ala lie Lys Val Leu Asp Val Pro Trp Arg Arg Val Trp His Leu 
755 760 765 

Tyr Glu Asp Lys Asp Tyr Ala Tyr Trp Lys Pro Phe Ala Cys Asp Ala 
770 775 780 

Lys Leu Glu Trp Val Cys Gin lie Pro Lys Gly Ser Thr Pro Gin Met 
785 790 795 800 

Pro Asp Trp Tyr Asn Pro Glu Arg Thr Gly lie His Gly Pro Pro Val 
805 810 815 

lie lie Glu Gly Ser Glu Tyr Trp Phe Val Ala Asp Pro His Leu Asn 
820 825 830 

Tyr Glu Glu Ala Val Leu Tyr Cys Ala Ser Asn His Ser Phe Leu Ala 
835 840 845 

Thr lie Thr Ser Phe Thr Gly Leu Lys Ala lie Lys Asn Lys Leu Ala 
850 855 860 

Asn lie Ser Gly Glu Glu Gin Lys Trp Trp Val Lys Thr Ser Glu Asn 
865 870 875 880 

Pro lie Asp Arg Tyr Phe Leu Gly Ser Arg Arg Arg Leu Trp His His 
885 890 895 

Phe Pro Met Thr Phe Gly Asp Glu Cys Leu His Met Ser Ala Lys Thr 
900 905 910 

Trp Leu Val Asp Leu Ser Lys Arg Ala Asp Cys Asn Ala Lys Leu Pro 
915 920 925 

Phe lie Cys Glu Arg Tyr Asn Val Ser Ser Leu Glu Lys Tyr Ser Pro 
930 935 940 

Asp Pro Ala Ala Lys Val Gin Cys Thr Glu Lys Trp lie Pro Phe Gin 
945 950 955 960 

Asn Lys Cys Phe Leu Lys Val Asn Ser Gly Pro Val Thr Phe Ser Gin 
965 970 975 

Ala Ser Gly lie Cys His Ser Tyr Gly Gly Thr Leu Pro Ser Val Leu 
980 985 990 



Ser Arg Gly Glu Gin Asp Phe He He Ser Leu Leu Pro Glu Met Glu 
995 1000 1005 



Ala Ser Leu Trp lie Gly Leu Arg Trp Thr Ala Tyr Glu Arg lie Asn 
1010 1015 1020 



Arg Trp Thr Asp Asn Arg Glu Leu Thr Tyr Ser Asn Phe His Pro Leu 
1025 1030 1035 1040 

Leu Val Gly Arg Arg Leu Ser lie Pro Thr Asn Phe Phe Asp Asp Glu 
1045 1050 1055 

Ser His Phe His Cys Ala Leu lie Leu Asn Leu Lys Lys Ser Pro Leu 
1060 1065 1070 

Thr Gly Thr Trp Asn Phe Thr Ser Cys Ser Glu Arg His Ser Leu Ser 
1075 1080 1085 

Leu Cys Gin Lys Tyr Ser Glu Thr Glu Asp Gly Gin Pro Trp Glu Asn 
1090 1095 1100 

Thr Ser Lys Thr Val Lys Tyr Leu Asn Asn Leu Tyr Lys lie lie Ser 
1105 1110 1115 1120 

Lys Pro Leu Thr Trp His Gly Ala Leu Lys Glu Cys Met Lys Glu Lys 
1125 1130 1135 

Met Arg Leu Val Ser lie Thr Asp Pro Tyr Gin Gin Ala Phe Leu Ala 
1140 1145 1150 

Val Gin Ala Thr Leu Arg Asn Ser Ser Phe Trp lie Gly Leu Ser Ser 
1155 1160 1165 

Gin Asp Asp Glu Leu Asn Phe Gly Trp Ser Asp Gly Lys Arg Leu Gin 
1170 1175 1180 

Phe Ser Asn Trp Ala Gly Ser Asn Glu Gin Leu Asp Asp Cys Val lie 
1185 1190 1195 1200 

Leu Asp Thr Asp Gly Phe Trp Lys Thr Ala Asp Cys Asp Asp Asn Gin 
1205 1210 1215 

Pro Gly Ala lie Cys Tyr Tyr Pro Gly Asn Glu Thr Glu Glu Glu Val 
1220 1225 1230 



Arg Ala Leu Asp Thr Ala Lys Cys 
1235 1240 

Trp lie Pro Phe Gin Asn Ser Cys 
1250 1255 

Arg His Lys Thr Val Thr Pro Glu 
1265 1270 



Pro Ser Pro Val Gin Ser Thr Pro 
1245 

Tyr Asn Phe Met lie Thr Asn Asn 
1260 

Glu Val Gin Ser Thr Cys Glu Lys 
1275 1280 



Leu His Pro Lys Ala His Ser Leu Ser lie Arg Asn Glu Glu Glu Asn 
1285 1290 1295 

Thr Phe Val Val Glu Gin Leu Leu Tyr Phe Asn Tyr lie Ala Ser Trp 
1300 1305 1310 

Val Met Leu Gly lie Thr Tyr Glu Asn Asn Ser Leu Met Trp Phe Asp 
1315 1320 1325 



Lys Thr Ala Leu Ser Tyr Thr His Trp Arg Thr Gly Arg Pro Thr Val 
1330 1335 1340 



Lys Asn Gly Lys Phe Leu Ala Gly Leu Ser Thr Asp Gly Phe Trp Asp 
1345 1350 1355 1360 



lie Gin Ser Phe Asn Val lie Glu Glu Thr Leu His Phe Tyr Gin His 
1365 1370 1375 

Ser lie Ser Ala Cys Lys lie Glu Met Val Asp Tyr Glu Asp Lys His 
1380 1385 1390 

Asn Gly Thr Leu Pro Gin Phe lie Pro Tyr Lys Asp Gly Val Tyr Ser 
1395 1400 1405 

Val lie Gin Lys Lys Val Thr Trp Tyr Glu Ala Leu Asn Ala Cys Ser 
1410 1415 1420 

Gin Ser Gly Gly Glu Leu Ala Ser Val His Asn Pro Asn Gly Lys Leu 
1425 1430 1435 1440 

Phe Leu Glu Asp lie Val Asn Arg Asp Gly Phe Pro Leu Trp Val Gly 
1445 1450 1455 

Leu Ser Ser His Asp Gly Ser Glu Ser Ser Phe Glu Trp Ser Asp Gly 
1460 1465 1470 



Arg Ala Phe Asp Tyr Val Pro Trp 
1475 1480 

Cys Val Val Leu Tyr Pro Lys Gly 
1490 1495 

Ser Val Lys Asp Gly Ala lie Cys 
1505 1510 

Leu lie Phe His Val Lys Ser Ser 
1525 



Gin Ser Leu Gin Ser Pro Gly Asp 
1485 

lie Trp Arg Arg Glu Lys Cys Leu 
1500 

Tyr Lys Pro Thr Lys Asp Lys Lys 
1515 1520 

Lys Cys Pro Val Ala Lys Arg Asp 
1530 1535 



Gly Pro Gin Trp Val Gin Tyr Gly Gly His Cys Tyr Ala Ser Asp Gin 
1540 1545 1550 

Val Leu His Ser Phe Ser Glu Ala Lys Gin Val Cys Gin Glu Leu Asp 
1555 1560 1565 

His Ser Ala Thr Val Val Thr lie Ala Asp Glu Asn Glu Asn Lys Phe 
1570 1575 1580 

Val Ser Arg Leu Met Arg Glu Asn Tyr Asn lie Thr Met Arg Val Trp 
1585 1590 1595 1600 

Leu Gly Leu Ser Gin His Ser Leu Asp Gin Ser Trp Ser Trp Leu Asp 
1605 1610 1615 

Gly Leu Asp Val Thr Phe Val Lys Trp Glu Asn Lys Thr Lys Asp Gly 
1620 1625 1630 

Asp Gly Lys Cys Ser lie Leu lie Ala Ser Asn Glu Thr Trp Arg Lys 
1635 1640 1645 

Val His Cys Ser Arg Gly Tyr Ala Arg Ala Val Cys Lys lie Pro Leu 
1650 1655 1660 

Ser Pro Asp Tyr Thr Gly lie Ala lie Leu Phe Ala Val Leu Cys Leu 
1665 1670 1675 1680 



Leu Gly Leu lie Ser Leu Ala lie Trp Phe Leu Leu Gin Arg Ser His 
1685 1690 1695 

lie Arg Trp Thr Gly Phe Ser Ser Val Arg Tyr Glu His Gly Thr Asn 
1700 1705 1710 

Glu Asp Glu Val Met Leu Pro Ser Phe His Asp 
1715 1720 



<210> 4 

<211> 30 

<212> PRT 

<213> Mus musculus 

<400> 4 

Arg Ser His lie Arg Trp Thr Gly Phe Ser Ser Val Arg Tyr Glu His 
1 5 10 15 

Gly Thr Asn Glu Asp Glu Val Met Leu Pro Ser Phe His Asp 
20 25 30 



<210> 5 

<211> 5477 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gaattccggg ggcgggagcc gcgtgcgccc 
caggatgagg acaggctggg cgacccctcg 
ctggttcttc gatctcgcgg agccctctgg 
ccatggaaat acgggcaagt gcatcaagcc 
tgatgaaact gaggacaagt tatggaagtg 
ctcccaaaag tgccttggcc tcgatattac 
ctgtgactcc agtgccatgc tgtggtggaa 
ccggtaccgg ctggctctga aggatggaca 
ctggaagaaa ggaggctcag aggaaagcct 
cagagatggg aactcttatg ggagaccttg 
gcatcatgat tgcattcttg atgaagatca 
ttatgaatat gaccgaaagt ggggcatctg 
ttgggaaaag aacgagcagt ttggaagttg 
ttggaaagaa gcttatgttt catgtcagaa 
tgctgctgaa ttaacttacc ttaaagataa 
tttaaatcag ctatactctg ctagaggctg 
tctcaactgg gatccagaca ggcccagtgc 
aatggatgct gagtctggtc tgtggcagag 
ctgcaggaaa ccattaaata atacagtgga 
ccgctgtgat gcaggctggc tgccaaataa 
taattcctgg gataaggcac atgcgaaatg 
tcattctcta gcagatgtgg aggtggttgt 
agaagtgtgg ataggcctta agaacataaa 
tactgaagtt actctaacat attgggatga 
gcccaactgt gtttcctact taggagagct 
gaaactaaaa tatgtatgca agagaaaggg 
gatgtgtcct ccagatgagg gctggaagag 
ggatgaggtc ccttttggaa caaactgcaa 
atacctaaat gatttgatga aaaagtatga 
cctgagagat gtagattctt gtggagagta 
ggctgtaacc ttttccaact ggaattttct 
tatgtctact ggaaagtctg ttggaaagtg 
actttcaatt tgcaagaaaa tgagtggacc 
tgatgacccc tgtcctgaag gctggcagag 



gaggacccgg ccggaaggct tgcgccagct 60 
ccgcccggcg gggctcctca tgctgctctt 120 
ccgcgcagct aatgacccct tcaccatcgt 180 
agtgtatggc tggatagtag cagacgactg 240 
ggtgtcccag catcggctct ttcatttgca 300 
caaatcggta aatgagctga gaatgttcag 3 60 
atgtgagcac cactctctgt acggagctgc 420 
tggcacagca atctcaaatg catctgatgt 480 
ttgtgaccag ccttatcatg agatctatac 540 
tgaatttcca ttcttaattg atgggacctg 600 
tagtgggcca tggtgtgcca ccaccttaaa 660 
cttaaagcct gaaaacggtt gtgaagataa 72 0 
ctaccaattt aatactcaga cggctctttc 780 
tcaaggagct gatttactga gcatcaacag 840 
agaaggcatt gctaagattt tctggattgg 900 
ggaatggtca gaccacaaac cattaaactt 960 
acctactata ggtggctcca gctgtgcaag 1020 
cttttcctgt gaagctcaac tgccctatgt 1080 
gttaacagat gtctggacat actcagatac 1140 
tggattttgc tatctgctgg taaatgaaag 1200 
caaagccttc agtagtgacc taatcagcat 12 60 
cacaaaactc cataatgagg atatcaaaga 132 0 
cataccaact ttatttcagt ggtcagatgg 13 80 
gaatgagcca aatgttccct acaataagac 1440 
aggtcagtgg aaagtccaat catgtgagga 1500 
agaaaaactg aatgacgcaa gttctgataa 1560 
acatggagaa acctgttaca agatttatga 162 0 
tctgactatc actagcagat ttgagcaaga 1680 
taaatctcta agaaaatact tctggactgg 1740 
taactgggca actgttggtg gaagaaggcg 1800 
tgagccagct tccccgggcg gctgcgtggc 1860 
ggaggtgaag gactgcagaa gcttcaaagc 1920 
ccttgggcct gaagaagcat cccctaagcc 1980 
tttccccgca agtctttctt gttataaggt 2040 



attccatgca gaaagaattg taagaaagag 
agcccttgga gcacaccttt ctagcttcag 
ctttttaacg gaccagttca gtggccagca 
cccagattta caaggatcct ggcaatggag 
gccaaatgag tttcagcagg attatgacat 
taggccatgg cgaagaggct ggcatttcta 
ttttgcttgt gatacaaaac ttgaatgggt 
aacaccagac tggtacaatc cagagcgtgc 
aggaagtgaa tattggtttg ttgctgatct 
ctgtgccagc aatcacagct ttcttgcaac 
caaaaacaaa atagcaaata tatctggtga 
gtggccaata gatgatcatt ttacatactc 
atttggagag gaatgcttgt acatgtctgc 
aacagactgt agtaccaagt tgcccttcat 
gaaatacagc ccagattctg cagctaaagt 
gaataagtgt tttctaaaga tcaaacccgt 
ctgtcactcc tatggtggca cccttccttc 
tacatccttg cttccggata tggaagctac 
tgaaaagata aacaaatgga cagataacag 
attggttagt gggaggctga gaataccaga 
ctgtgcccta atactcaacc tccaaaaatc 
ctgcagtgaa cgccactttg tgtctctctg 
gacgttgcag aatgcttcag aaactgtaaa 
aaagactctg acttggcaca gtgctaaaag 
gagcatcacg gacccttacc agcaggcatt 
ttccttatgg atcggactct tcagtcaaga 
gaaacgtctt cattttagtc gctgggctga 
attagacact gatggattct ggaaaacagt 
ttgctactat ccaggaaatg agactgaaaa 
tccatctcct gttctaaata ctccgtggat 
aataacaaag aataggcata tggcaacaac 
actgaatcca aaatcacata ttctgagtat 
tgagcaactg ctgtacttca attatatggc 
aaataattct cttatgtggt ttgataagac 
aagaccaact ataaaaaatg agaggttttt 
tattcaaacc tttaaagtta ttgaagaagc 
ttgtaaaatt gaaatggttg actacaaaga 
gccatatgaa gatggtattt acagtgttat 
aaacatgtgt tctcaaagtg gaggtcactt 
ctttctggaa gatattgtaa aacgtgatgg 
tgatggaagt gaatcaagtt ttgaatggtc 
gaaaggccaa acatctcctg gaaattgtgt 
tgaaaaatgc aactctgtta aggatggtgc 
gctgtcccgt cttacatatt catcaagatg 
gatccagtac aagggtcact gttacaagtc 
caaaaaattg tgttcaaaac atgatcactc 
tgagaataaa tttgtgagca gactgatgag 
gcttggatta tctcaacatt ctgttgacca 
gacatttgtc aaatgggaaa ataaaagtaa 
agcttcaaat gaaacttgga aaaaagttga 
caaagtgcct ctgggccctg attacacagc 
cttagttctc atgggcggac tgatttggtt 
gggtttctca tcagttcgat atgcacaagg 
tttccatgac taaattcttc taaaagtttt 
agtcacttaa aatgtccagt gtcagtattt 
tttttcagtt gtttagatct aggcatgtgc 
ccatgtttat caccctaatt aatagaatgg 
attgtttgac agtaata 



gaactgggaa gaagctgaac gattctgcca 2100 
ccatgtggat gaaataaagg aatttcttca 2160 
ttggctgtgg attggtttga ataaaaggag 2220 
tgatcgtaca ccagtgtcta ctattatcat 2280 
cagagactgt gctgctgtca aggtatttca 2340 
tgatgataga gaatttattt atttgaggcc 2400 
gtgccaaatt ccaaaaggcc gtactccaaa 2460 
tggaattcat ggacctccac ttataattga 2 520 
tcacctaaac tatgaagaag ccgtcctgta 2580 
tataacatct tttgtgggac taaaagccat 2640 
tggacagaag tggtggataa gaattagcga 27 00 
acgatatcca tggcaccgct ttcctgtgac 2760 
caagacttgg cttatcgact taggtaaacc 2 820 
ctgtgaaaaa tataatgttt cttcgttaga 2880 
gcaatgttct gagcaatgga ttccttttca 2940 
gtctctcaca ttttctcaag caagcgatac 3000 
agtgttgagc cagattgaac aagactttat 3060 
tttatggatt ggtttgcgct ggactgccta 3120 
agagctgacg tacagtaact ttcacccatt 3180 
aaattttttt gaggaagagt ctcgctacca 3240 
accgtttact gggacgtgga attttacatc 3300 
tcagaaatat tcagaagtta aaagcagaca 3 3 60 
gtatctaaat aatctgtaca aaataatccc 342 0 
ggagtgtctg aaaagtaaca tgcagctggt 3480 
cctcagtgtg caggcgctcc ttcacaactc 3540 
tgatgaactc aactttggtt ggtcagatgg 3 600 
aactaatggg caactcgaag actgtgtagt 3 660 
tgattgcaat gacaatcaac caggtgctat 372 0 
agaggtcaaa ccagttgaca gtgttaaatg 37 80 
accatttcag aactgttgct acaatttcat 3840 
acaggatgaa gttcatacta aatgccagaa 3900 
tcgagatgaa aaggagaata actttgttct 3960 
ttcatgggtc atgttaggaa taacttatag 4020 
cccactgtca tatacacatt ggagagcagg 4080 
ggctggttta agtactgacg gcttctggga 4140 
agtttatttt caccagcaca gcattcttgc 4200 
agaatataat actacactgc cacagtttat 4260 
tcaaaaaaag gtaacatggt atgaagcatt 432 0 
ggcaagcgtt cacaaccaaa atggccagct 43 80 
atttccacta tgggttgggc tctcaagtca 4440 
tgatggtagt acatttgact atatcccatg 4500 
tctcttggat ccaaaaggaa cttggaaaca 4560 
tatttgttat aaacctacaa aagctaaaaa 4620 
tccagcagca aaagagaatg ggtcacggtg 4680 
tgatcaggca ttgcacagtt tttcagaggc 4740 
tgcaactatc gtttccataa aagatgaaga 4800 
ggaaaataat aacattacca tgagagtttg 4860 
gtcttggagt tggttagatg gatcagaagt 492 0 
gagtggtgtt ggaagatgta gcatgttgat 4980 
atgtgaacat ggttttggaa gagttgtctg 5040 
aatagctatc atagttgcca cactaagtat 5100 
cctcttccaa aggcaccgtt tgcacctggc 5160 
agtgaatgaa gatgagatta tgcttccttc 5220 
ctaatttgca ctaatgtgtt atgagaaatt 5280 
actctgctcc aaagtagaac tcttaaatac 5340 
tggtatccac agttaattcc ctgctaaatg 5400 
aggggactcc aaagctggaa ctgaagtcaa 5460 

5477 



<210> 6 
<211> 1825 
<212> PRT 



<213> Homo sapiens 



<220> 

<221> MOD__RES 
<222> (1744) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (1751) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (1763) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (1785) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (1787) 

<223> Variable amino acid 
<220> 

<221> MODJRES 
<222> (1795) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (1807) . . (1808) 

<223> Variable amino acid 

<400> 6 

Asn Ser Gly Gly Gly Ser Arg Val Arg Pro Arg Thr Arg Pro Glu Gly 
15 10 15 

Leu Arg Gin Leu Arg Met Arg Thr Gly Trp Ala Thr Pro Arg Arg Pro 
20 25 30 

Ala Gly Leu Leu Met Leu Leu Phe Trp Phe Phe Asp Leu Ala Glu Pro 
35 40 45 

Ser Gly Arg Ala Ala Asn Asp Pro Phe Thr lie Val His Gly Asn Thr 
50 55 60 

Gly Lys Cys lie Lys Pro Val Tyr Gly Trp lie Val Ala Asp Asp Cys 
65 70 75 80 

Asp Glu Thr Glu Asp Lys Leu Trp Lys Trp Val Ser Gin His Arg Leu 
85 90 95 

Phe His Leu His Ser Gin Lys Cys Leu Gly Leu Asp lie Thr Lys Ser 
100 105 110 

Val Asn Glu Leu Arg Met Phe Ser Cys Asp Ser Ser Ala Met Leu Trp 
115 120 125 



Trp Lys Cys Glu His His Ser Leu Tyr Gly Ala Ala Arg Tyr Arg Leu 



130 135 140 

Ala Leu Lys Asp Gly His Gly Thr Ala lie Ser Asn Ala Ser Asp Val 
145 150 155 160 

Trp Lys Lys Gly Gly Ser Glu Glu Ser Leu Cys Asp Gin Pro Tyr His 
165 170 ~ 175 

Glu lie Tyr Thr Arg Asp Gly Asn Ser Tyr Gly Arg Pro Cys Glu Phe 
180 185 190 

Pro Phe Leu lie Asp Gly Thr Trp His His Asp Cys lie Leu Asp Glu 
195 200 205 

Asp His Ser Gly Pro Trp Cys Ala Thr Thr Leu Asn Tyr Glu Tyr Asp 
210 215 220 

Arg Lys Trp Gly lie Cys Leu Lys Pro Glu Asn Gly Cys Glu Asp Asn 
225 230 235 240 

Trp Glu Lys Asn Glu Gin Phe Gly Ser Cys Tyr Gin Phe Asn Thr Gin 
245 250 255 

Thr Ala Leu Ser Trp Lys Glu Ala Tyr Val Ser Cys Gin Asn Gin Gly 
260 265 270 

Ala Asp Leu Leu Ser lie Asn Ser Ala Ala Glu Leu Thr Tyr Leu Lys 
275 280 285 

Asp Lys Glu Gly lie Ala Lys lie Phe Trp lie Gly Leu Asn Gin Leu 
290 295 300 

Tyr Ser Ala Arg Gly Trp Glu Trp Ser Asp His Lys Pro Leu Asn Phe 
305 310 315 320 

Leu Asn Trp Asp Pro Asp Arg Pro Ser Ala Pro Thr lie Gly Gly Ser 
325 330 335 

Ser Cys Ala Arg Met Asp Ala Glu Ser Gly Leu Trp Gin Ser Phe Ser 
340 345 350 

Cys Glu Ala Gin Leu Pro Tyr Val Cys Arg Lys Pro Leu Asn Asn Thr 
355 360 365 

Val Glu Leu Thr Asp Val Trp Thr Tyr Ser Asp Thr Arg Cys Asp Ala 
370 375 380 

Gly Trp Leu Pro Asn Asn Gly Phe Cys Tyr Leu Leu Val Asn Glu Ser 
385 390 395 400 

Asn Ser Trp Asp Lys Ala His Ala Lys Cys Lys Ala Phe Ser Ser Asp 
405 410 415 

Leu lie Ser lie His Ser Leu Ala Asp Val Glu Val Val Val Thr Lys 
420 425 430 

Leu His Asn Glu Asp lie Lys Glu Glu Val Trp lie Gly Leu Lys Asn 
435 440 445 

lie Asn lie Pro Thr Leu Phe Gin Trp Ser Asp Gly Thr Glu Val Thr 
450 455 460 



Leu Thr Tyr Trp Asp Glu Asn Glu Pro Asn Val Pro Tyr Asn Lys Thr 



465 



470 



475 



480 



Pro Asn Cys Val 



Ser Cys Glu Glu 
500 

Leu Asn Asp Ala 
515 

Lys Arg His Gly 
530 

Phe Gly Thr Asn 
545 

Tyr Leu Asn Asp 



Phe Trp Thr Gly 
580 

Ala Thr Val Gly 
595 

Phe Leu Glu Pro 
610 

Lys Ser Val Gly 
625 

Leu Ser lie Cys 



Ser Pro Lys Pro 
660 

Ala Ser Leu Ser 
675 

Lys Arg Asn Trp 
690 

His Leu Ser Ser 
705 

Phe Leu Thr Asp 



Asn Lys Arg Ser 
740 

Thr Pro Val Ser 
755 

Asp lie Arg Asp 
770 

Arg Gly Trp His 
785 

Phe Ala Cys Asp 



Ser Tyr Leu Gly 
485 

Lys Leu Lys Tyr 



Ser Ser Asp Lys 
520 

Glu Thr Cys Tyr 
535 

Cys Asn Leu Thr 
550 

Leu Met Lys Lys 
565 

Leu Arg Asp Val 



Gly Arg Arg Arg 
600 

Ala Ser Pro Gly 
615 

Lys Trp Glu Val 
630 

Lys Lys Met Ser 
645 

Asp Asp Pro Cys 



Cys Tyr Lys Val 
680 

Glu Glu Ala Glu 
695 

Phe Ser His Val 
710 

Gin Phe Ser Gly 
725 

Pro Asp Leu Gin 



Thr lie lie Met 
760 

Cys Ala Ala Val 
775 

Phe Tyr Asp Asp 
790 

Thr Lys Leu Glu 



Glu Leu Gly Gin 
490 

Val Cys Lys Arg 
505 

Met Cys Pro Pro 



Lys lie Tyr Glu 
540 

lie Thr Ser Arg 
555 

Tyr Asp Lys Ser 
570 

Asp Ser Cys Gly 
585 

Ala Val Thr Phe 



Gly Cys Val Ala 
620 

Lys Asp Cys Arg 
635 

Gly Pro Leu Gly 
650 

Pro Glu Gly Trp 
665 

Phe His Ala Glu 



Arg Phe Cys Gin 
700 

Asp Glu lie Lys 
715 

Gin His Trp Leu 
730 

Gly Ser Trp Gin 
745 

Pro Asn Glu Phe 



Lys Val Phe His 
780 

Arg Glu Phe lie 
795 

Trp Val Cys Gin 



Trp Lys Val Gin 
495 

Lys Gly Glu Lys 
510 

Asp Glu Gly Trp 
525 

Asp Glu Val Pro 



Phe Glu Gin Glu 
560 

Leu Arg Lys Tyr 
575 

Glu Tyr Asn Trp 
590 

Ser Asn Trp Asn 
605 

Met Ser Thr Gly 



Ser Phe Lys Ala 
640 

Pro Glu Glu Ala 
655 

Gin Ser Phe Pro 
670 

Arg lie Val Arg 
685 

Ala Leu Gly Ala 



Glu Phe Leu His 
720 

Trp lie Gly Leu 
735 



Trp Ser Asp Arg 
750 

Gin Gin Asp Tyr 
765 

Arg Pro Trp Arg 



Tyr Leu Arg Pro 
800 

lie Pro Lys Gly 



805 810 815 

Arg Thr Pro Lys Thr Pro Asp Trp Tyr Asn Pro Glu Arg Ala Gly lie 
820 825 830 

His Gly Pro Pro Leu He He Glu Gly Ser Glu Tyr Trp Phe Val Ala 
835 840 845 

Asp Leu His Leu Asn Tyr Glu Glu Ala Val Leu Tyr Cys Ala Ser Asn 
850 855 860 

His Ser Phe Leu Ala Thr He Thr Ser Phe Val Gly Leu Lys Ala He 
865 870 875 880 

Lys Asn Lys He Ala Asn He Ser Gly Asp Gly Gin Lys Trp Trp He 
885 890 895 

Arg He Ser Glu Trp Pro He Asp Asp His Phe Thr Tyr Ser Arg Tyr 
900 905 910 

Pro Trp His Arg Phe Pro Val Thr Phe Gly Glu Glu Cys Leu Tyr Met 
915 920 925 

Ser Ala Lys Thr Trp Leu He Asp Leu Gly Lys Pro Thr Asp Cys Ser 
930 935 940 

Thr Lys Leu Pro Phe He Cys Glu Lys Tyr Asn Val Ser Ser Leu Glu 
945 950 955 960 

Lys Tyr Ser Pro Asp Ser Ala Ala Lys Val Gin Cys Ser Glu Gin Trp 
965 970 975 

He Pro Phe Gin Asn Lys Cys Phe Leu Lys He Lys Pro Val Ser Leu 
980 985 990 

Thr Phe Ser Gin Ala Ser Asp Thr Cys His Ser Tyr Gly Gly Thr Leu 
995 1000 1005 

Pro Ser Val Leu Ser Gin He Glu Gin Asp Phe He Thr Ser Leu Leu 
1010 1015 1020 

Pro Asp Met Glu Ala Thr Leu Trp He Gly Leu Arg Trp Thr Ala Tyr 
1025 1030 1035 1040 

Glu Lys He Asn Lys Trp Thr Asp Asn Arg Glu Leu Thr Tyr Ser Asn 
1045 1050 1055 

Phe His Pro Leu Leu Val Ser Gly Arg Leu Arg He Pro Glu Asn Phe 
1060 1065 1070 

Phe Glu Glu Glu Ser Arg Tyr His Cys Ala Leu He Leu Asn Leu Gin 
1075 1080 1085 

Lys Ser Pro Phe Thr Gly Thr Trp Asn Phe Thr Ser Cys Ser Glu Arg 
1090 1095 1100 

His Phe Val Ser Leu Cys Gin Lys Tyr Ser Glu Val Lys Ser Arg Gin 
1105 1110 1115 1120 

Thr Leu Gin Asn Ala Ser Glu Thr Val Lys Tyr Leu Asn Asn Leu Tyr 
1125 1130 1135 



Lys He He Pro Lys Thr Leu Thr Trp His Ser Ala Lys Arg Glu Cys 



1140 



1145 



1150 



Leu Lys Ser Asn Met Gin Leu Val Ser lie Thr Asp Pro Tyr Gin Gin 
1155 1160 1165 

Ala Phe Leu Ser Val Gin Ala Leu Leu His Asn Ser Ser Leu Trp lie 
1170 1175 1180 

Gly Leu Phe Ser Gin Asp Asp Glu Leu Asn Phe Gly Trp Ser Asp Gly 
1185 1190 1195 1200 

Lys Arg Leu His Phe Ser Arg Trp Ala Glu Thr Asn Gly Gin Leu Glu 
1205 1210 1215 

Asp Cys Val Val Leu Asp Thr Asp Gly Phe Trp Lys Thr Val Asp Cys 
1220 1225 1230 

Asn Asp Asn Gin Pro Gly Ala lie Cys Tyr Tyr Pro Gly Asn Glu Thr 
1235 1240 1245 

Glu Lys Glu Val Lys Pro Val Asp Ser Val Lys Cys Pro Ser Pro Val 
1250 1255 1260 

Leu Asn Thr Pro Trp lie Pro Phe Gin Asn Cys Cys Tyr Asn Phe lie 
1265 1270 1275 1280 

lie Thr Lys Asn Arg His Met Ala Thr Thr Gin Asp Glu Val His Thr 
1285 1290 1295 

Lys Cys Gin Lys Leu Asn Pro Lys Ser His lie Leu Ser lie Arg Asp 
1300 1305 1310 

Glu Lys Glu Asn Asn Phe Val Leu Glu Gin Leu Leu Tyr Phe Asn Tyr 
1315 1320 1325 

Met Ala Ser Trp Val Met Leu Gly lie Thr Tyr Arg Asn Asn Ser Leu 
1330 1335 1340 

Met Trp Phe Asp Lys Thr Pro Leu Ser Tyr Thr His Trp Arg Ala Gly 
1345 1350 1355 1360 

Arg Pro Thr lie Lys Asn Glu Arg Phe Leu Ala Gly Leu Ser Thr Asp 
1365 1370 1375 

Gly Phe Trp Asp lie Gin Thr Phe Lys Val lie Glu Glu Ala Val Tyr 
1380 1385 1390 

Phe His Gin His Ser lie Leu Ala Cys Lys lie Glu Met Val Asp Tyr 
1395 1400 1405 

Lys Glu Glu Tyr Asn Thr Thr Leu Pro Gin Phe Met Pro Tyr Glu Asp 
1410 1415 1420 

Gly lie Tyr Ser Val He Gin Lys Lys Val Thr Trp Tyr Glu Ala Leu 
1425 1430 1435 1440 

Asn Met Cys Ser Gin Ser Gly Gly His Leu Ala Ser Val His Asn Gin 
1445 1450 1455 

Asn Gly Gin Leu Phe Leu Glu Asp He Val Lys Arg Asp Gly Phe Pro 
1460 1465 1470 



Leu Trp Val Gly Leu Ser Ser His Asp Gly Ser Glu Ser Ser Phe Glu 



1475 



1480 



1485 



Trp Ser Asp Gly Ser Thr Phe Asp Tyr lie Pro Trp Lys Gly Gin Thr 
1490 1495 1500 

Ser Pro Gly Asn Cys Val Leu Leu Asp Pro Lys Gly Thr Trp Lys His 
1505 1510 1515 1520 

Glu Lys Cys Asn Ser Val Lys Asp Gly Ala lie Cys Tyr Lys Pro Thr 
1525 1530 1535 

Lys Ala Lys Lys Leu Ser Arg Leu Thr Tyr Ser Ser Arg Cys Pro Ala 
1540 1545 1550 

Ala Lys Glu Asn Gly Ser Arg Trp lie Gin Tyr Lys Gly His Cys Tyr 
1555 1560 1565 

Lys Ser Asp Gin Ala Leu His Ser Phe Ser Glu Ala Lys Lys Leu Cys 
1570 1575 1580 

Ser Lys His Asp His Ser Ala Thr lie Val Ser lie Lys Asp Glu Asp 
1585 1590 1595 1600 

Glu Asn Lys Phe Val Ser Arg Leu Met Arg Glu Asn Asn Asn lie Thr 
1605 1610 1615 

Met Arg Val Trp Leu Gly Leu Ser Gin His Ser Val Asp Gin Ser Trp 
1620 1625 1630 



Ser Trp Leu Asp Gly Ser Glu Val 
1635 1640 

Ser Lys Ser Gly Val Gly Arg Cys 
1650 1655 

Thr Trp Lys Lys Val Glu Cys Glu 
1665 1670 

Lys Val Pro Leu Gly Pro Asp Tyr 
1685 



Thr Phe Val Lys Trp Glu Asn Lys 
1645 

Ser Met Leu lie Ala Ser Asn Glu 
1660 

His Gly Phe Gly Arg Val Val Cys 
1675 1680 

Thr Ala lie Ala lie He Val Ala 
1690 1695 



Thr Leu Ser He Leu Val Leu Met Gly Gly Leu He Trp Phe Leu Phe 
1700 1705 1710 

Gin Arg His Arg Leu His Leu Ala Gly Phe Ser Ser Val Arg Tyr Ala 
1715 1720 1725 

Gin Gly Val Asn Glu Asp Glu He Met Leu Pro Ser Phe His Asp Xaa 
1730 1735 1740 

He Leu Leu Lys Val Phe Xaa Phe Ala Leu Met Cys Tyr Glu Lys Leu 
1745 1750 1755 1760 

Val Thr Xaa Asn Val Gin Cys Gin Tyr Leu Leu Cys Ser Lys Val Glu 
1765 1770 1775 

Leu Leu Asn Thr Phe Ser Val Val Xaa He Xaa Ala Cys Ala Gly He 
1780 1785 1790 

His Ser Xaa Phe Pro Ala Lys Cys His Val Tyr His Pro Asn Xaa Xaa 
1795 1800 1805 

Asn Gly Gly Asp Ser Lys Ala Gly Thr Glu Val Lys Leu Phe Asp Ser 



1810 1815 1820 



Asn 
1825 



<210> 7 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 7 

tcgaagttgt ccttactcac aatcactcta tagcgccggc gatttgata 49 



<210> 8 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 8 

tcttctcaga gagggtgaga ggaccatttc gatcgatcac tcgccggcga tttgata 57 



<210> 9 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 9 

ctagcgacat ggccaagaag gagacagtct ggaggctcga ggagttcggt aggttcacaa 60 
acaggaac 68 



<210> 10 
<211> 71 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 10 

acagacggta gcacagacta tggtattctc cagattaaca gcaggtatta tgacggtagg 60 
acatgatagg c 71 



<210> 11 
<211> 70 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 11 

gtctgtgttc ctgtttgtga acctaccgaa ctcctcgagc ctccagactg tctccttctt 60 
ggccatgtcg 70 



<210> 12 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 12 

ggccgcctat catgtcctac cgtcataata cctgctgtta atctggagaa taccatagtc 60 
tgtgctacc 69 



<210> 13 

<211> 330 

<212> DNA 

<213> Homo sapiens 



<400> 13 

caggctgttg tgactcagga atcagcactc accacatcac ctggtgaaac agtcacactc 60 
acttgtcgct caagtactgg ggctgttaca attagtaact atgccaactg ggtccaagaa 120 
aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt 180 
cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca 240 
cagactgagg atgaggcaat ctatttctgt gctctatggt acaacaacca gtttattttc 300 
ggcagtggaa ccaaggtcac tgtcctaggt 3 30 



<210> 14 

<211> 354 

<212> DNA 

<213> Homo sapiens 



<400> 14 

gaggtccagc tgcaacagtc tggacctgtg ctggtgaagc ctggggcttc agtgaagatg 60 
tcctgtaagg cttctggaaa cacattcact gactccttta tgcactggat gaaacagagc 120 
catggaaaga gtcttgagtg gattggaatt attaatcctt ataacggcgg tactagctac 180 
aaccagaaat tcaagggcaa ggccacattg actgttgaca agtcctccag cacagcctac 240 
atggagctca acagcctgac atctgaggac tctgcagtct attactgtgc aagaaacggg 3 00 
gtgcggtact actttgacta ctggggccaa ggcaccactc tcacagtctc ctca 354 



<210> 15 
<211> 8 
<212> PRT 

<213> Mus musculus 



<400> 15 

Ser lie lie Asn Phe Glu Lys Leu 
1 5 



<210> 16 
<211> 17 
<212> PRT 

<213> Mus musculus 
<400> 16 

Leu Ser Gin Ala Val His Ala Ala His Ala Glu lie Asn Glu Ala Gly 
15 10 15 

Arg 



<210> 17 
<211> 8858 
<212> DNA 

<213> Homo sapiens 



<400> 17 

gacggatcgg 

agtaaccttt 

tcccgatccc 

gtatctgctc 

tacaacaagg 

tgcgctgctt 

ttaatagtaa 

ataacttacg 

aataatgacg 

ggactattta 

gccccctatt 

cttatgggac 

gatgcggttt 

aagtctccac 

tccaaaatgt 

ggaggtctat 

cgaaattaat 

agcttctctt 

agtctccagc 

gccagagtat 

ttctcatcaa 

gatcagggtc 

attactgtca 

tcaaacgtaa 

aactctgagg 

ggtcagaaaa 

gaactttatt 

acgcttcttg 

ctaacatgcc 

atcctgtttg 

atctgatgag 

tcccagagag 

ggagagtgtc 

gctgagcaaa 

cctgagctcg 

ccacctgctc 

cacaggggac 

ctccttggct 

acctgtggtt 

aatttctctt 

atcatccttc 

caagccttct 

cccctcctca 

cctttgattc 



gagatctgct 
ttttttaatt 
ctatggtcga 
cctgcttgtg 
caaggcttga 
cgcgatgtac 
tcaattacgg 
gtaaatggcc 
tatgttccca 
cggtaaactg 
gacgtcaatg 
tttcctactt 
tggcagtaca 
cccattgacg 
cgtaacaact 
ataagcagag 
acgactcact 
cctcctgcta 
caccctgtct 
tagcgactac 
atatgcttcc 
agatttcact 
acatggtcac 
gtctcgagtc 
gggtcggatg 
gcatgcaaag 
aaggaatagg 
gtctccttgc 
cttatccgca 
cttctttcct 
cagttgaaat 
gccaaagtac 
acagagcagg 
gcagactacg 
cccgtcacaa 
ctcagttcca 
ctacccctat 
ttaattatgc 
tctctctttc 
ataagggact 
attctatttt 
gtcctcacag 
gcaagccctc 
aattccctga 



agcccgggtg 
ttattttatt 
ctctcagtac 
tgttggaggt 
ccgacaattg 
gggccagata 
ggtcattagt 
cgcctggctg 
tagtaacgcc 
cccacttggc 
acggtaaatg 
ggcagtacat 
tcaatgggcg 
tcaatgggag 
ccgccccatt 
ctctctggct 
atagggagac 
ctctggctcc 
gtgactccag 
ttacactggt 
cattccatct 
ctcagtatca 
agctttccgt 
tctagataac 
acgtggccat 
ccctcagaat 
gggaagctag 
tataattatc 
aacaacacac 
caggaactgt 
ctggaactgc 
agtggaaggt 
acagcaagga 
agaaacacaa 
agagcttcaa 
gcctgacccc 
tgcggtcctc 
taatgttgga 
ctcatttaat 
aaatatgtag 
accctatcat 
tcccctgggc 
atagtccttt 
gaatcaacca 



acctgaggcg 
ttatttttga 
aatctgctct 
cgctgagtag 
catgaagaat 
tacgcgttga 
tcatagccca 
accgcccaac 
aatagggact 
agtacatcaa 
gcccgcctgg 
ctacgtatta 
tggatagcgg 
tttgttttgg 
gacgcaaatg 
aactagagaa 
ccaagcttgg 
cagataccac 
gagatagagt 
atcaacaaaa 
ctgggatccc 
acagtgtgga 
ggacgttcgg 
cggtcaatcg 
tctttgccta 
ggctgcaaag 
gaagaaactc 
tgggataagc 
ccaagggcag 
ggctgcacca 
ctctgttgtg 
ggataacgcc 
cagcacctac 
agtctacgcc 
caggggagag 
ctcccatcct 
cagctcatct 
ggagaatgaa 
aattattatc 
tcatcctaag 
cctctgcaag 
catggtagga 
ttaagggtga 
aagcaaattt 



cgccggcttc 
gatggagttt 
gatgccgcat 
tgcgcgagca 
ctgcttaggg 
cattgattat 
tatatggagt 
gacccccgcc 
ttccattgac 
gtgtatcata 
cattatgccc 
gtcatcgcta 
tttgactcac 
caccaaaatc 
ggcggtaggc 
cccactgctt 
taccatggaa 
cggagacatt 
ctctctttcc 
atcacatgag 
ctccaggttc 
acctgaagat 
tggaggcacc 
gtcaatcgat 
aagcattgag 
agctccaaca 
aaaacatcaa 
atgctgtttt 
aactttgtta 
tctgtcttca 
tgcctgctga 
ctccaatcgg 
agcctcagca 
tgcgaagtca 
tgttagaggg 
ttggcctctg 
ttcacctcac 
taaataaagt 
tgttgtttta 
gcacgtaacc 
acagtcctcc 
gagacttgct 
caggtcttac 
ttcaaaagaa 



gaatagccag 
ggcgccgatc 
agttaagcca 
aaatttaagc 
ttaggcgttt 
tgactagtta 
tccgcgttac 
cattgacgtc 
gtcaatgggt 
tgccaagtac 
agtacatgac 
ttaccatggt 
ggggatttcc 
aacgggactt 
gtgtacggtg 
actggcttat 
gccccagctc 
gttctgactc 
tgcagggcca 
tctccaaggc 
agtggcagtg 
gttggaattt 
aagctggaaa 
tggaattcta 
tttactgcaa 
aaacaattta 
gattttaaat 
ctgtctgtcc 
cttaaacacc 
tcttcccgcc 
ataacttcta 
gtaactccca 
gcaccctgac 
cccatcaggg 
agaagtgccc 
accctttttc 
ccccctcctc 
gaatctttgc 
ccaactactc 
atttataaaa 
ctcaaaccca 
tccttgtttt 
agtcatatat 
gaaacctgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 



ataaagagaa tcattcattg caacatgata 
taaacaaaca atagggaaat gtttaagttc 
tgccttattt acatttttaa acaggtactg 
agtactttcc acaacctaat ttaatccaca 
aatgttgcaa aggttctata aagctgagag 
cttctagatg actgagtgtc cccacccacc 
tttatttaca aaagccaaaa attggaaata 
taaactgtgg tatgtttata cattagaata 
atacctactc acacagatga atctcataaa 
aaagatatgt tctgtatgtt ttcatccata 
agaaatttgg atggaaatta ctcttagctg 
caagaagggg cttctggggt cttggtaatg 
tgatctgtgc actgttctgt atacacatta 
cttataccca gttaatagat agaagaggaa 
taagtggggg cctgggatca aatagctacc 
gagtctgcct tccagggctc aaggtgctca 
atctgtgccc tgtttggcta gctaggagca 
ttcagcaagg ggacagagga cagaattaac 
acactcacat gtttgggaag ggggaagggc 
tggggcactc tggccctgcc cctctcagct 
cctctctctg cctacactct gaaggggttc 
aaatgactga caatcccttt gtcctgcttt 
ggggaaggac agtcatggag aaactacata 
aatgttgatg agtatcaaat ctttcaaact 
aatgtccctt ccaatgacat gaacttgctc 
aaaggcaggc ataatccagt tatgaattct 
tccaaccgcg gaagggccct attctatagt 
cctcgactgt gccttctagt tgccagccat 
tgaccctgga aggtgccact cccactgtcc 
attgtctgag taggtgtcat tctattctgg 
aggattggga agacaatagc aggcatgctg 
cggaaagaac cagctggggc tctagggggt 
gcgcggcggg tgtggtggtt acgcgcagcg 
ccgctccttt cgctttcttc ccttcctttc 
ggaaaaaaag catgcatctc aattagtcag 
tcccgcccct aactccgccc agttccgccc 
ttatttatgc agaggccgag gccgcctcgg 
gcttttttgg aggcctaggc ttttgcaaaa 
gccaaacttg acggcaatcc tagcgtgaag 
tggttcgacc attgaactgc atcgtcgccg 
gagacctacc ctggcctccg ctcaggaacg 
cctcttcagt ggaaggtaaa cagaatctgg 
ttcctgagaa gaatcgacct ttaaaggaca 
aagaaccacc acgaggagct cattttcttg 
ttgaacaacc ggaattggca agtaaagtag 
tttaccagga agccatgaat caaccaggcc 
aggaatttga aagtgacacg tttttcccag 
cagaataccc aggcgtcctc tctgaggtcc 
aagtctacga gaagaaagac taacaggaag 
agctatgcat ttttataaga ccatgggact 
accttacttc tgtggtgtga cataattgga 
ggtaaatata aaatttttaa gtgtataatg 
attttagatt ccaacctatg gaactgatga 
ggaaaacctg ttttgctcag aagaaatgcc 
tcaacattct actcctccaa aaaagaagag 
agaattgcta agttttttga gtcatgctgt 
tatttacacc acaaaggaaa aagctgcact 
tgtaaccttt ataagtaggc ataacagtta 
acacaggcat agagtgtctg ctattaataa 
tttaatttgt aaaggggtta ataaggaata 
taatcagcca taccacattt gtagaggttt 
ccctgaacct gaaacataaa atgaatgcaa 
ataatggtta caaataaagc aatagcatca 



taaaataaca acacaataaa agcaattaaa 2700 
atcatggtac ttagacttaa tggaatgtca 2760 
agggactcct gtctgccaag ggccgtattg 2 820 
ctatactgtg agattaaaaa cattcattaa 2880 
acaaatatat tctataactc agcaatccca 2940 
aaaaaactat gcaagaatgt tcaaagcagc 3000 
gcccgattgt ccaacaatag aatgagttat 3060 
cccaatgagg agaattaaca agctacaact 3120 
aataatgtta cataagagaa actcaatgca 3180 
taaagttcaa aaccaggtaa aaataaagtt 3240 
ggggtgggcg agttagtgcc tgggagaaga 3 3 00 
ttctgttcct cgtgtggggt tgtgcagtta 3360 
tgcttcaaaa taacttcaca taaagaacat 3420 
taagtaatag gtcaagacca acgcagctgg 3480 
tgcctaatcc tgcccwcttg agccctgaat 3540 
acaaaacaac aggcctgcta ttttcctggc 3 600 
cacatacata gaaattaaat gaaacagacc 3 660 
cttgcccaga cactggaaac ccatgtatga 3720 
acatgtaaat gaggactctt cctcattcta 3780 
actcatccat ccaacacacc tttctaagta 3840 
aggagtaact aacacagcat cccttccctc 3900 
gtttttcttt ccagtcagta ctgggaaagt 3 960 
aggaagcacc ttgcccttct gcctcttgag 4020 
ttggaggttt gagtaggggt gagactcagt 4080 
actcatccct gggggccaaa ttgaacaatc 4140 
tgcggccgct tgctagcttc acgtgttgga 4200 
gtcacctaaa tgctagagct cgctgatcag 4260 
ctgttgtttg cccctccccc gtgccttcct 4320 
tttcctaata aaatgaggaa attgcatcgc 4380 
ggggtggggt ggggcaggac agcaaggggg 4440 
gggatgcggt gggctctatg gcttctgagg 4500 
atccccacgc gccctgtagc ggcgcattaa 4560 
tgaccgctac acttgccagc gccctagcgc 4 62 0 
tcgccacgtt cgccgggcct ctcaaaaaag 4680 
caaccatagt cccgccccta actccgccca 4740 
attctccgcc ccatggctga ctaatttttt 4800 
cctctgagct attccagaag tagtgaggag 4860 
agcttggaca gctcagggct gcgatttcgc 492 0 
gctggtagga ttttatcccc gctgccatca 4980 
tgtcccaaaa tatggggatt ggcaagaacg 5040 
agttcaagta cttccaaaga atgaccacaa 5100 
tgattatggg taggaaaacc tggttctcca 5160 
gaattaatat agttctcagt agagaactca 5220 
ccaaaagttt ggatgatgcc ttaagactta 52 80 
acatggtttg gatagtcgga ggcagttctg 5340 
accttagact ctttgtgaca aggatcatgc 5400 
aaattgattt ggggaaatat aaacttctcc 5460 
aggaggaaaa aggcatcaag tataagtttg 552 0 
atgctttcaa gttctctgct cccctcctaa 5580 
tttgctggct ttagatctct ttgtgaagga 5640 
caaactacct acagagattt aaagctctaa 5700 
tgttaaacta ctgattctaa ttgtttgtgt 5760 
atgggagcag tggtggaatg cctttaatga 5820 
atctagtgat gatgaggcta ctgctgactc 5880 
aaaggtagaa gaccccaagg actttccttc 5940 
gtttagtaat agaactcttg cttgctttgc 6000 
gctatacaag aaaattatgg aaaaatattc 6060 
taatcataac atactgtttt ttcttactcc 6120 
ctatgctcaa aaattgtgta cctttagctt 6180 
tttgatgtat agtgccttga ctagagatca 6240 
tacttgcttt aaaaaacctc ccacacctcc 6300 
ttgttgttgt taacttgttt attgcagctt 6360 
caaatttcac aaataaagca tttttttcac 6420 



tgcattctag ttgtggtttg tccaaactca 
ggatgatcct ccagcgcggg gatctcatgc 
ttgcagctta taatggttac aaataaagca 
ttttttcact gcattctagt tgtggtttgt 
gtataccgtc gacctctagc tagagcttgg 
gaaattgtta tccgctcaca attccacaca 
cctggggtgc ctaatgagtg agctaactca 
tccagtcggg aaacctgtcg tgccagctgc 
gcggtttgcg tattgggcgc tcttccgctt 
ttcggctgcg gcgagcggta tcagctcact 
caggggataa cgcaggaaag aacatgtgag 
aaaaggccgc gttgctggcg tttttccata 
atcgacgctc aagtcagagg tggcgaaacc 
cccctggaag ctccctcgtg cgctctcctg 
ccgcctttct cccttcggga agcgtggcgc 
gttcggtgta ggtcgttcgc tccaagctgg 
accgctgcgc cttatccggt aactatcgtc 
cgccactggc agcagccact ggtaacagga 
cagagttctt gaagtggtgg cctaactacg 
gcgctctgct gaagccagtt accttcggaa 
aaaccaccgc tggtagcggt ggtttttttg 
aaggatctca agaagatcct ttgatctttt 
actcacgtta agggattttg gtcatgagat 
taaattaaaa atgaagtttt aaatcaatct 
gttaccaatg cttaatcagt gaggcaccta 
tagttgcctg actccccgtc gtgtagataa 
ccagtgctgc aatgataccg cgagacccac 
accagccagc cggaagggcc gagcgcagaa 
agtctattaa ttgttgccgg gaagctagag 
acgttgttgc cattgctaca ggcatcgtgg 
tcagctccgg ttcccaacga tcaaggcgag 
cggttagctc cttcggtcct ccgatcgttg 
tcatggttat ggcagcactg cataattctc 
ctgtgactgg tgagtactca accaagtcat 
gctcttgccc ggcgtcaata cgggataata 
tcatcattgg aaaacgttct tcggggcgaa 
ccagttcgat gtaacccact cgtgcaccca 
gcgtttctgg gtgagcaaaa acaggaaggc 
cacggaaatg ttgaatactc atactcttcc 
gttattgtct catgagcgga tacatatttg 
ttccgcgcac atttccccga aaagtgccac 



tcaatgtatc ttatcatgtc tggatcggct 6480 
tggagttctt cgcccacccc aacttgttta 6540 
atagcatcac aaatttcaca aataaagcat 6600 
ccaaactcat caatgtatct tatcatgtct 6660 
cgtaatcatg gtcatagctg tttcctgtgt 6720 
acatacgagc cggaagcata aagtgtaaag 6780 
cattaattgc gttgcgctca ctgcccgctt 6840 
attaatgaat cggccaacgc gcggggagag 6900 
cctcgctcac tgactcgctg cgctcggtcg 6960 
caaaggcggt aatacggtta tccacagaat 7 02 0 
caaaaggcca gcaaaaggcc aggaaccgta 7 080 
ggctccgccc ccctgacgag catcacaaaa 7140 
cgacaggact ataaagatac caggcgtttc 7200 
ttccgaccct gccgcttacc ggatacctgt 7260 
tttctcaatg ctcacgctgt aggtatctca 7320 
gctgtgtgca cgaacccccc gttcagcccg 7380 
ttgagtccaa cccggtaaga cacgacttat 7440 
ttagcagagc gaggtatgta ggcggtgcta 7 500 
gctacactag aaggacagta tttggtatct 7 560 
aaagagttgg tagctcttga tccggcaaac 7 62 0 
tttgcaagca gcagattacg cgcagaaaaa 7 680 
ctacggggtc tgacgctcag tggaacgaaa 7740 
tatcaaaaag gatcttcacc tagatccttt 7800 
aaagtatata tgagtaaact tggtctgaca 7 860 
tctcagcgat ctgtctattt cgttcatcca 7920 
ctacgatacg ggagggctta ccatctggcc 7980 
gctcaccggc tccagattta tcagcaataa 8040 
gtggtcctgc aactttatcc gcctccatcc 8100 
taagtagttc gccagttaat agtttgcgca 8160 
tgtcacgctc gtcgtttggt atggcttcat 8220 
ttacatgatc ccccatgttg tgcaaaaaag 8280 
tcagaagtaa gttggccgca gtgttatcac 8340 
ttactgtcat gccatccgta agatgctttt 8400 
tctgagaata gtgtatgcgg cgaccgagtt 8460 
ccgcgccaca tagcagaact ttaaaagtgc 852 0 
aactctcaag gatcttaccg ctgttgagat 8580 
actgatcttc agcatctttt actttcacca 8640 
aaaatgccgc aaaaaaggga ataagggcga 8700 
tttttcaata ttattgaagc atttatcagg 8760 
aatgtattta gaaaaataaa caaatagggg 882 0 
ctgacgtc 8858 



<210> 18 
<211> 744 
<212> DNA 

<213> Homo sapiens 
<400> 18 

gaggtgcagc tgttggagtc tgggggaggc 
tcctgtgcag cctctggatt caccttcagt 
ccagggaagg ggctggagtg ggtctcagca 
gcagactccc tgaagggccg gttcaccatc 
ctgcaaatga acagcctgag agccgaggac 
gttgactaca gtaacccctc cggtatggac 
agcagcggtg gaggcggttc aggcggaggt 
ctgactcagc caccctcagc gtctgggacc 
ggaagcagct ccaacatcgg aagtaatact 
gcccccaaac tcctcatcta tagggataat 
tctggctcca agtctggcac ctcagcctcc 
gaggctgatt attactgtgc agcatgggat 
ggaaccaagc tgacggtcct aggt 



ttggtacagc ctggggggtc cctgagactc 6 0 
gactactaca tgagctgggt ccgccaggct 120 
attagtggta gtggtgggag cacatactac 180 
tccagagaca attccaagaa cacgctgtat 2 40 
acggccgtgt attactgtgc gagagcccct 300 
gtctggggcc aaggtacact ggtcaccgtg 3 60 
ggatccggcg gtggcggatc gcagtctgtg 420 
cccgggcaga gggtcaccat ctcttgttct 480 
gttaactggt atcagcagct cccaggaacg 540 
cagcgaccct caggggtccc tgaccgattc 600 
ctggccatca gtgggctccg gtccgaggat 660 
gacagcctga atggttgggt gttcggcgga 720 

744 



<210> 19 

<211> 630 

<212> DNA 

<213> Homo sapiens 



<400> 19 

ggtctatata 

ccgtcgacgg 

ctttttcttg 

ggaggcttgg 

tttagcagct 

tcagctatta 

accatctcca 

gaggacacgg 

gggtactttg 

ggcccatcgg 

ctgggctgcc 



agcagagctg 
tgatcaggac 
tggctatttt 
tacagcctgg 
atgccatgag 
gtggtagtgg 
gagacaattc 
ccgtatatta 
actactgggg 
tcttccccct 
tggtcaagga 



ggtacgtcct 
tgaacagaga 
aaaaggtgtc 

ggggtccctg 
ctgggtccgc 
tggtagcaca 
caagaacacg 
ctgtgcgaaa 
ccagggaacc 
ggcaccctcc 
ctacttcccc 



cacattcagt 
gaactcacca 
cagtgtgagg 
agactctcct 
caggctccag 
tactacgcag 
ctgtatctgc 
gatggggggt 
ctggtcaccg 
tccaagagca 



gatcagcact 
tggagtttgg 
tgcagctgtt 
gtgcagcctc 
ggaaggggct 
actccgtgaa 
aaatgaacag 
actatggttc 
tctcctcagc 
cctctggggg 



gaacacagac 
gctgagctgg 
ggagtctggg 
tggattcacc 
ggagtgggtc 
gggccggttc 
cctgagagcc 
ggggagttat 
tagcaccaag 
cacagcggcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

630 



<210> 20 
<211> 8953 
<212> DNA 

<213> Human immunodeficiency virus 
<220> 

<221> modif ied_base 
<222> (7216) 

<223> a, c, t, g, unknown or other 



<400> 20 

ttctctcgac 

ctggtgagta 

cgagagcgtc 

cagggggaaa 

gatttgcact 

agctccaacc 

caaccctcta 

agatagagga 

aagacaacag 

tacaccaggc 

ccttcagccc 

atttaaacac 

atactatcaa 

ttgcaccagg 

tccaggaaca 

aaaggtggat 

tggatataag 

ctctaagagc 

tccaaaatgc 

tagaagaaat 

tggctgaagc 

gggatcataa 

actgcagggc 

aagactgcac 

ggccagggaa 

ggttcggaga 

taacttccct 

gctaagggaa 

gccaggaaaa 

gtataatgag 

acctacacct 

ttttccaatt 



gcaggactcg 
cgccaaaaat 
aatattaagt 
gaaacaatat 
taaccctggc 
agctcttcag 
ttgcgtacat 
aattcaaaac 
tcaggtcagt 
cttatcacct 
agaagtgata 
catgctaaac 
tgaggaagct 
ccaggtgaga 
aataggatgg 
aattctggga 
acaaggacca 
tgagcaagct 
aaatccagat 
gatgacagca 
aatgagccaa 
aagaattgtt 
ccctagaaaa 
tgagagacag 
ttttctccag 
ggagatcccc 
cagatcactc 
gctctattag 
tggaaaccaa 
gtacccatag 
gtcaacataa 
agtcctattg 



gcttgctgaa 
attttttgac 

gggggaaaat 

aggataaaac 
cttctagagt 
acaggaacag 
cagaggatag 
aaaaacaaac 
caaaattatc 
agaactttaa 
cccatgtttt 
acaatagggg 
gcagaatggg 
gaaccaaggg 
atgacaggca 
ctaaataaaa 
aaagaacctt 
acacaggatg 
tgcaagacca 
tgtcagggag 
gtgaccaata 
aagtgtttca 
aagggctgtt 
gctaattttt 
agcagaccag 
tccccgaaac 
tttggcagcg 
atacaggagc 
aaatgatagg 
aaattgaggg 
ttggaagaaa 
aaactgtacc 



gtgcacacgg 
tagcggaggc 
tagacgattg 
atctagtatg 
cagcaaaagg 
aagaaattaa 
agataaaaga 
agcagacaca 
ctatagtgca 
atgcatgggt 
cagcattatc 
gacaccaagc 
acagggtaca 
gaagtgatat 
acccaccgat 
tagtgagaat 
tcagagacta 
taaaaaattg 
ttttaaaagc 
tgggaggacc 
caaacataat 
attgtggcaa 
ggaaatgtgg 
tagggaagat 
agccaacagc 
aggagccgaa 
accccttgtc 
agacgataca 
gggaattgga 
aaaaaaggct 
catgttgact 
agtaaaatta 



caagaggcga 
tagaaggaga 
ggaaaaaatt 
ggcaagcagg 
ctgtcaacaa 
atcattatat 
caccaaggaa 
gaaagcagaa 
gaatctgcaa 
aaaagtgata 
agaaggagcc 
agctatgcaa 
tccagtacat 
agcaggaact 
cccagtagga 
gtatagccct 
tgtagacaga 
gatgacagat 
attaggatca 
tggtcataaa 
gatgcaaaga 
acaaggacac 
aaaggaagga 
ttggccttcc 
cccaccagca 
ggacaaggaa 
acaataagaa 
gtattagaag 
ggttttatca 
ataggtacag 
cagcttggtt 
aagccaggaa 



agggcggcga 
gagatgggtg 
cggttgaggc 
gagctggaca 
atactagtac 
aatacagtag 
gctttagaca 
actgacaaaa 
gggcaaccgg 
gaagagaaag 
accccgcaag 
atgttaaaag 
gcagggcctg 
actagtaacc 
gaaatttata 
gtcagcattt 
ttctttaaag 
accttgttgg 
ggagctacac 
gcaagagttt 
ggtaacttta 
atagcaaaaa 
caccaaatga 
agcaaaggga 
gagagcctcg 
ctgtatcctc 
taggggggca 
aaatagattt 
aagtgagaca 
tattaatagg 
gtactttaaa 
tggatggccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 



aaaaattaaa caatggccat tgacagaaga aaaaataaaa gcattaacac aaatttgtgc 1980 
agaactggaa gaggagggaa aaatttcaag aattgggcct gaaaatccat ataacacccc 2 040 
agtatttgcc ataaagaaaa aagacagtac taaatggaga aaattagtag attttagaga 2100 
actcaataaa agaactcaag acttctggga agttcagtta ggaataccac atccagcagg 2160 
gttaaaaaag aaaaaatcag tcacagtact ggatgtgggg gatgcatatt tttcagtccc 222 0 
tttatatgaa gatttcagga agtatactgc attcactata cctagtataa acaatgagac 2280 
accagggatc agatatcagt acaacgtgct accacaggga tggaaaggat caccagcaat 23 40 
atttcagtgt agcatgacaa aaatcttaaa accttttaga gaaagaaacc cagaaatagt 2400 
tatctaccag tacatggatg acttgtatgt gggatctgac ttggaaatag aacagcatag 2460 
aagaaaaata aaggagctga gggaacatct attgaagtgg ggattttaca caccagataa 2 52 0 
aaaacatcag aaagaacctc catttctttg gatgggatat gagctccatc ctgacaaatg 2580 
gacagtacaa cctatacagc tgccagaaaa agaagattgg actgtcaatg atatacaaaa 2640 
gttagtggga aaactaaatt gggcaagtca aatttatcca ggaattaaaa taaaggaact 2700 
atgtaaactc attagggggg ctaaagcact aacagacata gtaccattga ctagagaagc 2760 
agaattggaa ctggcagaaa acaaggagat tctaaaagaa ccagtacatg gggtatatta 2 82 0 
tgacccagca agagaattaa tagcagaagt gcagaaacaa ggactggacc aatggacata 2 880 
tcaaatttat caggagccat ttaaaaacct gaaaacaggg aaatatgcaa aaaggaggag 2940 
tgcccacact aatgatgtaa agcaattatc acaagtggtg caaaaaatag cc ttggaagc 3000 
catagtgata tggggaaaaa ctcctaaatt tagactaccc atacaaaagg aaacatggga 3060 
gacatggtgg acagactatt ggcaggccac ctggattcct gagtgggagt ttgtcaatac 312 0 
cccccctcta gtaaaattat ggtaccaatt agaaaaggaa cccataatgg gagcagaaac 3180 
tttctatgta gatggggcat ctaacaggga aactaaagta ggaaaggcag ggtatgttac 3240 
tgacaaagga agacagaaag taattaccct aactgacaca acaaatcaga agactgaact 3 300 
acaagccatt tatttagctt tacaggattc agggatagaa gtaaacatag taacagattc 3 3 60 
acaatatgca ttggggatta ttcaagcaca accagataag agtgaatcag aattagtcaa 342 0 
tcaaataata gaggagttaa taaagaagga aaaggtctac ctgtcgtggg taccagcaca 3480 
caaaggaatt ggaggaaatg aacaagtaga taaattagtc agttctggaa tcaggaaagt 3 540 
gctgtttcta gatgggatag ataaagctca agaagaacat gaaaaatatc atagcaattg 3 600 
gagagcaatg gctagtgatt ttaatctacc acctgtagta gcaaaagaaa tagtagctag 3 660 
ctgtgataaa tgtcagctaa agggggaagc catgcatgga caagtagact gtagtccagg 372 0 
gatatggcaa ttagattgta cacatttaga aggaaaagtt atcctggtag cagttcatgt 37 80 
agccagtggc tatatagaag cagaagttat cccagcagaa acaggacagg aagcagcatt 3 840 
ttttatatta aaattagcag gcggatggcc agtaaaagca atacatacag ataatggcag 3 900 
caacttcacc agtggtgctg tgaaggcagc ctgttggtgg gcagatatca aacaggaatt 3960 
tggaattccc tacaatcccc aaagtcaagg agtagtagaa tctatgaata aagaattaaa 402 0 
gaaaatcata ggacaggtaa gagaacaagc tgaacacctt aagacagcag tacagatggc 4080 
agtattcata cacaatttta aaagaaaagg ggggattggg ggatacagtg caggggaaag 4140 
aataatagac ataatagcaa cagacataca aactaaagaa ttacaaaaac aaatcacaaa 42 00 
aattcaaaat tttcgggttt attacaggga cagcagagac ccaatttgga aaggaccagc 42 60 
aaaactgctc tggaaaggtg aaggggcagt agtaatacaa gacaatagtg aaataaaggt 432 0 
agtaccaaga agaaaagcaa agatcattag agattatgga aaacagatgg caggtgatga 4380 
ttgtgtggca ggtagacagg atgaggatta acacatggaa aagtttagta aagtaccata 4440 
tgaatgtttc aaagaaagct agacaatggc tgtatagaca tcactatgat agccgtcatc 4500 
caaaaataag ttcagaagta cacatcccac taggagaggc tagattagta gtaacaacat 4560 
attggggtct gcaaacagga gaaagagatt ggcacttggg tcagggagtc tccatagaat 4620 
ggaggcggaa aaggtacaga acacaagtag accctggcct ggcagaccaa ctaattcata 4680 
tgcattactt tgattgtttt tcagactctg ccataaggaa ggccatatta ggacaaatag 4740 
ttagccctag gtgtgactac caagcaggac ataacaaggt aggatctcta caatatctgg 4800 
cattaacagc attaataaaa ccaaaaagga gaaagccacc tttgcctagt gttcagaaac 4860 
tagtagagga tagatggaac aagccccaga agaccaggga ccacagagag agccatacca 492 0 
tgaatggaca ctagagcttt tggaggagct taaaaatgaa gctgttagac actttcctag 4980 
gccatggctc catggtttag gacagtatgt ctatagcact tatggagata catgggaagg 5040 
agtcgaagcc gtaataagaa tactgcaaca actattgttt attcatttca gaatcgggtg 5100 
ccatcatagc agaataggca ttataccaca gagaagaggg aggaatggag ccagtagatc 5160 
ctaacagaga gccctggaac catccaggaa gtcagcctaa aactgcttgt actaattgtt 5220 
attgtaaaaa gtgttgctat cattgtcaag tgtgctttct acagaagggc ttaggcattt 52 80 
cctatggcag gaagaagcgg agacaacgac gatcagctcc tcctggcagt aagaatcatc 5340 
aagatcttat accagagcag taagtaactt aattagcata tgtaatggta tctttacaaa 5400 
tagtagcaat agtagcatta atagtagcat ttttccttgc aatatgtgtg tggactatag 5460 
tgtatataga atataagaaa ctgttaagac aaaggaaaat agataagtta attaatagaa 552 0 
taagagaaag ggcagaagac agtggtaacg agagtgatgg agacacagac gagttggctg 5580 
agcttgtgga gatggggcct catgatcttt ggaatgttaa tgatttgtag tgctagagaa 5640 
aacttgtggg tcacagtcta ttatggggta cctgtatgga gagatgcaaa gaccacttta 5700 



ttttgcgcat 
gcttgtgtac 
tttaacatgt 
gatgaaagct 
gacgtcaata 
atcatgaaaa 
tacgcgcttt 
ctaataaact 
attcccatac 
ttcaatggaa 
ccagtggtat 
agatctgaaa 
gaaattgtgt 
caagtgctct 
agtagaaaag 
aataaaacaa 
tttaattgtg 
gatgagaata 
tgtaaaataa 
cccatcgcag 
ggaggaaaca 
atgaaagata 
ggagtagcac 
ataggagctg 
ataacgctga 
ttgctgaagg 
cagctccagg 
atttggggct 
agtaataaat 
attaacaatt 
actaatgaaa 
atatcaaact 
ttaagaataa 
ttgtcatttc 
gaaggaggtg 
cttgcctggg 
gtcttgattg 
tatctggtga 
cttaatacca 
agagctttta 
ttgctataaa 
gcgagctgct 
gcatggggca 
agcacaaaca 
gacatacaaa 
gttaatttac 
ttacttccct 
tggatggtgc 
agagaacaac 
agtgctaaag 
tccggagttt 
ctttccaggg 
tataagcagc 
gagctctctg 
cgcatgcaag 



ctgatgctaa 
ccacagatcc 
ggaaaaatga 
taaagccatg 
gtaacaatag 
actgctcttt 
tctatagaca 
gtaatacctc 
attattgtgc 
caggaccatg 
caactcaact 
atatctcaga 
gtaccagacc 
acgcaacagg 
attggaataa 
tagactttac 
gaggagaatt 
acattaagga 
aacaaattgt 
gaaacattac 
ggaatggcag 
attggagaag 
ccaccaaggc 
tgttccttgg 
cggtacaggt 
ctatanaagc 
caagagtcct 
gctctggaaa 
cttatgagga 
acacaggaat 
aggacttatt 
ggctgtggta 
tttttgctgt 
agacccttat 
gagagcaagg 
acgacctgcg 
cagcgaggac 
accttgtatg 
cagcaatagc 
gagctattct 
atggggaaca 
cctgctccag 
atcacaagca 
gaggaggagg 
ggagcttttg 
tccaagaaaa 
gactggcaaa 
ttcaagctag 
tgcctgttac 
tggaagtttg 
tacaaagact 
gaggtgtggc 
tgcttttcgc 
gctaactagg 
ccg 



agcatatagt 
aaacccacaa 
catggtagac 
tgtaaagata 
tacagatagt 
caatgtaact 
agatgtagta 
agttattaaa 
tccagctggt 
caaaaatgtc 
actgctaaat 
caatgctaaa 
taataacaat 
ggaaataata 
cactttacgt 
atcaccctca 
tttctattgt 
cacaaatagt 
gagaatgtgg 
atgcaaatca 
tgagaatggc 
tgaattatat 
aaaaagaaga 
gttcttggga 
cagacaattg 
gcaacagcat 
ggctgtggaa 
actcatctgc 
catttgggag 
aatatacagt 
ggcattggac 
cataaaaata 
gcttgctata 
tccaaaccca 
cagaaccaga 
gagcctgtgc 
tgtgggaact 
gtattggggg 
agtagctgaa 
tcacatacct 
agtggtcaaa 
cagcagatgg 
gcaatacagc 
ttggatttcc 
atcttagctt 
gacaagagat 
actacacacc 
taccagttga 
accccgcatg 
acagctccct 
gctgacaaag 
ctgggcgggg 
ttgtactggg 
gaacccactg 



actgaaaaac 
gagatgagtc 
cagatgcagg 
acccctctct 
aatagtagtg 
acagaaataa 
ccaattaata 
caggcttgtc 
tttgcgattc 
agtacagtac 
ggcagtgtag 
aacataatag 
acaagaaaag 
ggagatataa 
agagtagcta 
ggaggggaca 
aatacatcaa 
acaaatgaca 
caaagaacag 
aatattacag 
actgagacct 
aaatataaag 
gtggtggaga 
acagcaggaa 
ttgtctggca 
ctgttgaagc 
agatacctaa 
accactaatg 
aacatgacct 
ctaattgaag 
aagtggacaa 
ttcataatga 
gtaaatagag 
acggaagccg 
tcgattcgat 
ctcttcagtt 
ctgggactca 
caggaactaa 
ggaacagata 
agaagaataa 
aagttggcct 
agtgggagca 
agctacaaat 
agtcagacct 
ctttttaaaa 
ccttgatctg 
agggccaggg 
tccaagcgaa 
ccagcatgga 
agcacggaga 
aagtttctag 
ttggggagtg 
tctctcttgt 
cttaagcctc 



ataatgtctg 
tgccaaatgt 
aagatataat 
gtgtcacttt 
caagcaacaa 
gaaataaaag 
gtgacaataa 
caaaggtgtc 
taaaatgtaa 
aatgtacaca 
cagaaggaga 
tacaacttaa 
gtatacacat 
ggaaagcata 
aaaaactaag 
tagaaattac 
cactgttcaa 
acacaactat 
gacaagcaat 
gattattatt 
tcagacctac 
tagtagagct 
gagaaaaaag 
gcactatggg 
tagtgcaaca 
tcactgtctg 
aggatcaaca 
tgccctggaa 
ggatacaatg 
aagcacaaaa 
atttgtggaa 
taataggagg 
ttaggcaggg 
acaggcccgg 
tagtgaacgg 
accaccgatt 
gggggtggga 
agaatagtgc 
gaatcataga 
gacagggttt 
caggtaaggg 
gtgtctcaag 
gatgactgtg 
caggtwccat 
gaaaaggggg 
tgggttcata 
accabatacc 
gtagaggaag 
atagaggatg 
cacatagccc 
cggggacttt 
gctaaccctc 
tagaccagat 
aataaagctt 



ggctacacat 
aacagaaaat 
cagtgtatgg 
aaattgtagc 
tagtcctgaa 
gaagcaagaa 
aagttatatt 
ttttcaacca 
taataagact 
cggaattaag 
cataataatt 
tgacactgta 
gggaccagga 
ttgtaacatt 
agaacacttt 
aacacatagt 
tagtagttgg 
cacaatacca 
atatgcccct 
gacacgtgat 
aggaggaaat 
tgagccacta 
agcagtggga 
cgcagcgtca 
gcaaagcaat 
gggcattaag 
gctcctagga 
tgctagttgg 
ggaaagggaa 
ccagcaggaa 
ttggtttaac 
cttaataggt 
atactcacct 
aggaatcgaa 
attcttagct 
gagagacttc 
gatcctcaaa 
tattagtttg 
aatagcacaa 
agaaagagct 
acagaatgag 
atttggctaa 
cctggctgga 
taagaccaat 
gactggatgg 
acacacaagg 
cattgacatt 
ctaatgaagg 
aagaaagaga 
gagagctaca 
ccgctgggga 
agatgctgca 
ctgagcctgg 
gccttgaggg 



5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
8953 



<210> 21 
<211> 497 
<212> PRT 

<213> Human immunodeficiency virus 



<400> 21 



Met Gly Ala Arg Ala Ser lie Leu Ser Gly Gly Lys Leu Asp Asp Trp 
15 10 15 



Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Gin Tyr Arg lie Lys 
20 25 30 

His Leu Val Trp Ala Ser Arg Glu Leu Asp Arg Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Ser Ala Lys Gly Cys Gin Gin lie Leu Val Gin Leu 
50 55 60 

Gin Pro Ala Leu Gin Thr Gly Thr Glu Glu lie Lys Ser Leu Tyr Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Gin Arg lie Glu lie Lys Asp 
85 90 95 

Thr Lys Glu Ala Leu Asp Lys lie Glu Glu lie Gin Asn Lys Asn Lys 
100 105 110 

Gin Gin Thr Gin Lys Ala Glu Thr Asp Lys Lys Asp Asn Ser Gin Val 
115 120 125 

Ser Gin Asn Tyr Pro lie Val Gin Asn Leu Gin Gly Gin Pro Val His 
130 135 140 

Gin Ala Leu Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val lie Glu 
145 150 155 160 

Glu Lys Ala Phe Ser Pro Glu Val lie Pro Met Phe Ser Ala Leu Ser 
165 170 175 

Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr lie Gly 
180 185 190 

Gly His Gin Ala Ala Met Gin Met Leu Lys Asp Thr lie Asn Glu Glu 
195 200 205 

Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro Val Ala 
210 215 220 

Pro Gly Gin Val Arg Glu Pro Arg Gly Ser Asp lie Ala Gly Thr Thr 
225 230 235 240 

Ser Asn Leu Gin Glu Gin lie Gly Trp Met Thr Gly Asn Pro Pro lie 
245 250 255 

Pro Val Gly Glu lie Tyr Lys Arg Trp lie lie Leu Gly Leu Asn Lys 
260 265 270 

lie Val Arg Met Tyr Ser Pro Val Ser lie Leu Asp lie Arg Gin Gly 
275 280 285 

Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Ala Leu 
290 295 300 

Arg Ala Glu Gin Ala Thr Gin Asp Val Lys Asn Trp Met Thr Asp Thr 
305 310 315 320 



Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Lys Ala 
325 330 335 



Leu Gly Ser Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly 
340 345 350 



Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 
355 360 365 

Gin Val Thr Asn Thr Asn lie Met Met Gin Arg Gly Asn Phe Arg Asp 
370 375 380 

His Lys Arg lie Val Lys Cys Phe Asn Cys Gly Lys Gin Gly His lie 
385 390 395 400 

Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly 
405 410 415 

Lys Glu Gly His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe 
420 425 430 

Leu Gly Lys lie Trp Pro Ser Ser Lys Gly Arg Pro Gly Asn Phe Leu 
435 440 445 

Gin Ser Arg Pro Glu Pro Thr Ala Pro Pro Ala Glu Ser Leu Gly Phe 
450 455 460 

Gly Glu Glu lie Pro Ser Pro Lys Gin Glu Pro Lys Asp Lys Glu Leu 
465 470 475 480 

Tyr Pro Leu Thr Ser Leu Arg Ser Leu Phe Gly Ser Asp Pro Leu Ser 
485 490 495 

Gin 



<210> 22 
<211> 1001 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 22 

Phe Phe Arg Glu Asp Leu Ala Phe Gin Gin Arg Glu Ala Arg Glu Phe 
15 10 15 

Ser Pro Glu Gin Thr Arg Ala Asn Ser Pro Thr Ser Arg Glu Pro Arg 
20 25 30 

Val Arg Arg Gly Asp Pro Leu Pro Glu Thr Gly Ala Glu Gly Gin Gly 
35 40 45 

Thr Val Ser Ser Asn Phe Pro Gin lie Thr Leu Trp Gin Arg Pro Leu 
50 55 60 

Val Thr lie Arg lie Gly Gly Gin Leu Arg Glu Ala Leu Leu Asp Thr 
65 70 75 80 

Gly Ala Asp Asp Thr Val Leu Glu Glu lie Asp Leu Pro Gly Lys Trp 
85 90 95 

Lys Pro Lys Met lie Gly Gly lie Gly Gly Phe lie Lys Val Arg Gin 
100 105 110 



Tyr Asn Glu Val Pro lie Glu lie Glu Gly Lys Lys Ala lie Gly Thr 
115 120 125 



Val Leu lie Gly Pro Thr Pro Val Asn lie lie Gly Arg Asn Met Leu 
130 135 140 

Thr Gin Leu Gly Cys Thr Leu Asn Phe Pro lie Ser Pro lie Glu Thr 
145 150 155 160 

Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys lie Lys Gin 
165 170 175 

Trp Pro Leu Thr Glu Glu Lys lie Lys Ala Leu Thr Gin lie Cys Ala 
180 185 190 

Glu Leu Glu Glu Glu Gly Lys lie Ser Arg lie Gly Pro Glu Asn Pro 
195 200 205 

Tyr Asn Thr Pro Val Phe Ala lie Lys Lys Lys Asp Ser Thr Lys Trp 
210 215 220 

Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp Phe 
225 230 235 240 

Trp Glu Val Gin Leu Gly lie Pro His Pro Ala Gly Leu Lys Lys Lys 
245 250 255 

Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val Pro 
260 265 270 

Leu Tyr Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr lie Pro Ser lie 
275 280 285 

Asn Asn Glu Thr Pro Gly lie Arg Tyr Gin Tyr Asn Val Leu Pro Gin 
290 295 300 

Gly Trp Lys Gly Ser Pro Ala lie Phe Gin Cys Ser Met Thr Lys lie 
305 310 315 320 

Leu Lys Pro Phe Arg Glu Arg Asn Pro Glu lie Val He Tyr Gin Tyr 
325 330 335 

Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu He Glu Gin His Arg 
340 345 350 

Arg Lys He Lys Glu Leu Arg Glu His Leu Leu Lys Trp Gly Phe Tyr 
355 360 365 

Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp Met Gly 
370 375 380 

Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro He Gin Leu Pro 
385 390 395 400 

Glu Lys Glu Asp Trp Thr Val Asn Asp He Gin Lys Leu Val Gly Lys 
405 410 415 

Leu Asn Trp Ala Ser Gin He Tyr Pro Gly He Lys He Lys Glu Leu 
420 425 430 

Cys Lys Leu He Arg Gly Ala Lys Ala Leu Thr Asp He Val Pro Leu 
435 440 445 

Thr Arg Glu Ala Glu Leu Glu Leu Ala Glu Asn Lys Glu He Leu Lys 
450 455 460 



Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ala Arg Glu Leu lie Ala 
465 470 475 480 



Glu Val Gin Lys Gin Gly Leu Asp Gin Trp Thr Tyr Gin lie Tyr Gin 
485 490 495 

Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Lys Arg Arg Ser 
500 505 510 

Ala His Thr Asn Asp Val Lys Gin Leu Ser Gin Val Val Gin Lys lie 
515 520 525 

Ala Leu Glu Ala He Val He Trp Gly Lys Thr Pro Lys Phe Arg Leu 
530 535 540 

Pro He Gin Lys Glu Thr Trp Glu Thr Trp Trp Thr Asp Tyr Trp Gin 
545 550 555 560 

Ala Thr Trp He Pro Glu Trp Glu Phe Val Asn Thr Pro Pro Leu Val 
565 570 575 

Lys Leu Trp Tyr Gin Leu Glu Lys Glu Pro He Met Gly Ala Glu Thr 
580 585 590 

Phe Tyr Val Asp Gly Ala Ser Asn Arg Glu Thr Lys Val Gly Lys Ala 
595 600 605 

Gly Tyr Val Thr Asp Lys Gly Arg Gin Lys Val He Thr Leu Thr Asp 
610 615 620 

Thr Thr Asn Gin Lys Thr Glu Leu Gin Ala He Tyr Leu Ala Leu Gin 
625 630 635 640 

Asp Ser Gly He Glu Val Asn He Val Thr Asp Ser Gin Tyr Ala Leu 
645 650 655 

Gly He He Gin Ala Gin Pro Asp Lys Ser Glu Ser Glu Leu Val Asn 
660 665 670 

Gin He He Glu Glu Leu He Lys Lys Glu Lys Val Tyr Leu Ser Trp 
675 680 685 

Val Pro Ala His Lys Gly He Gly Gly Asn Glu Gin Val Asp Lys Leu 
690 695 700 

Val Ser Ser Gly He Arg Lys Val Leu Phe Leu Asp Gly He Asp Lys 
705 710 715 720 

Ala Gin Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala 
725 730 735 

Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu He Val Ala Ser 
740 745 750 

Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin Val Asp 
755 760 765 

Cys Ser Pro Gly He Trp Gin Leu Asp Cys Thr His Leu Glu Gly Lys 
770 775 780 



Val He Leu Val Ala Val His Val Ala Ser Gly Tyr He Glu Ala Glu 
785 790 795 800 



Val lie Pro Ala Glu 
805 



Thr Gly Gin Glu Ala Ala Phe Phe lie Leu Lys 
810 815 



Leu Ala Gly Gly Trp Pro Val Lys Ala lie His Thr Asp Asn Gly Ser 
820 825 830 

Asn Phe Thr Ser Gly Ala Val Lys Ala Ala Cys Trp Trp Ala Asp lie 
835 840 845 

Lys Gin Glu Phe Gly lie Pro Tyr Asn Pro Gin Ser Gin Gly Val Val 
850 855 860 

Glu Ser Met Asn Lys Glu Leu Lys Lys lie lie Gly Gin Val Arg Glu 
865 870 875 880 

Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val Phe lie His 
885 890 895 

Asn Phe Lys Arg Lys Gly Gly lie Gly Gly Tyr Ser Ala Gly Glu Arg 
900 905 910 

lie lie Asp lie lie Ala Thr Asp lie Gin Thr Lys Glu Leu Gin Lys 
915 920 925 

Gin lie Thr Lys lie Gin Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg 
930 935 940 

Asp Pro lie Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly Glu Gly 
945 950 955 960 

Ala Val Val lie Gin Asp Asn Ser Glu lie Lys Val Val Pro Arg Arg 
965 970 975 

Lys Ala Lys lie lie Arg Asp Tyr Gly Lys Gin Met Ala Gly Asp Asp 
980 985 990 

Cys Val Ala. Gly Arg Gin Asp Glu Asp 
995 1000 



<210> 23 
<211> 101 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 23 

Met Glu Pro Val Asp Pro Asn Arg Glu Pro Trp Asn His Pro Gly Ser 
15 10 15 

Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Tyr 
20 25 30 

His Cys Gin Val Cys Phe Leu Gin Lys Gly Leu Gly He Ser Tyr Gly 
35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Ser Ala Pro Pro Gly Ser Lys Asn 
50 55 60 

His Gin Asp Leu He Pro Glu Gin Pro Leu Phe Gin Thr Gin Arg Lys 
65 70 75 80 

Pro Thr Gly Pro Glu Glu Ser Lys Lys Glu Val Glu Ser Lys Ala Glu 



85 



Pro Asp Arg Phe Asp 
100 



90 



95 



<210> 24 
<211> 116 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 24 

Met Ala Gly Arg Ser Gly Asp Asn Asp Asp Gin Leu Leu Leu Ala Val 
15 10 15 

Arg lie lie Lys lie Leu Tyr Gin Ser Asn Pro Tyr Ser Lys Pro Asn 
20 25 30 

Gly Ser Arg Gin Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Ala Arg 
35 40 45 

Gin Asn Gin lie Asp Ser lie Ser Glu Arg lie Leu Ser Ser Cys Leu 
50 55 60 

Gly Arg Pro Ala Glu Pro Val Pro Leu Gin Leu Pro Pro lie Glu Arg 
65 70 75 80 

Leu Arg Leu Asp Cys Ser Glu Asp Cys Gly Asn Ser Gly Thr Gin Gly 
85 90 95 

Val Gly Asp Pro Gin lie Ser Gly Glu Pro Cys Met Val Leu Gly Ala 
100 105 110 

Gly Thr Lys Glu 
115 



<210> 25 
<211> 850 
<212> PRT 

<213> Human immunodeficiency virus 
<220> 

<221> MOD_RES 
<222> (554) 

<223> Variable amino acid 
<400> 25 

Met Glu Thr Gin Thr Ser Trp Leu Ser Leu Trp Arg Trp Gly Leu Met 
15 10 15 

lie Phe Gly Met Leu Met lie Cys Ser Ala Arg Glu Asn Leu Trp Val 
20 25 30 

Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Asp Ala Lys Thr Thr Leu 
35 40 45 

Phe Cys Ala Ser Asp Ala Lys Ala Tyr Ser Thr Glu Lys His Asn Val 
50 55 60 



Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Met 
65 70 75 80 



Ser Leu Pro Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
85 90 95 



Val Asp Gin Met Gin Glu Asp lie lie Ser Val Trp Asp Glu Ser Leu 
100 105 110 

Lys Pro Cys Val Lys lie Thr Pro Leu Cys Val Thr Leu Asn Cys Ser 
115 120 125 

Asp Val Asn Ser Asn Asn Ser Thr Asp Ser Asn Ser Ser Ala Ser Asn 
130 135 140 

Asn Ser Pro Glu lie Met Lys Asn Cys Ser Phe Asn Val Thr Thr Glu 
145 150 155 160 

lie Arg Asn Lys Arg Lys Gin Glu Tyr Ala Leu Phe Tyr Arg Gin Asp 
165 170 175 

Val Val Pro lie Asn Ser Asp Asn Lys Ser Tyr lie Leu lie Asn Cys 
180 185 190 

Asn Thr Ser Val lie Lys Gin Ala Cys Pro Lys Val Ser Phe Gin Pro 
195 200 205 

lie Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys 
210 215 220 

Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr 
225 230 235 240 

Val Gin Cys Thr His Gly lie Lys Pro Val Val Ser Thr Gin Leu Leu 
245 250 255 

Leu Asn Gly Ser Val Ala Glu Gly Asp lie lie lie Arg Ser Glu Asn 
260 265 270 

lie Ser Asp Asn Ala Lys Asn lie lie Val Gin Leu Asn Asp Thr Val 
275 280 285 

Glu lie Val Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly lie His 
290 295 300 

Met Gly Pro Gly Gin Val Leu Tyr Ala Thr Gly Glu lie lie Gly Asp 
305 310 315 320 

He Arg Lys Ala Tyr Cys Asn He Ser Arg Lys Asp Trp Asn Asn Thr 
325 330 335 

Leu Arg Arg Val Ala Lys Lys Leu Arg Glu His Phe Asn Lys Thr He 
340 345 350 

Asp Phe Thr Ser Pro Ser Gly Gly Asp He Glu He Thr Thr His Ser 
355 360 365 

Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Thr Leu Phe 
370 375 380 

Asn Ser Ser Trp Asp Glu Asn Asn He Lys Asp Thr Asn Ser Thr Asn 
385 390 395 400 



Asp Asn Thr Thr He Thr He Pro Cys Lys He Lys Gin He Val Arg 
405 410 415 



Met Trp Gin Arg Thr Gly Gin Ala lie Tyr Ala Pro Pro lie Ala Gly 
420 425 430 



Asn lie Thr Cys Lys Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp 
435 440 445 

Gly Gly Asn Arg Asn Gly Ser Glu Asn Gly Thr Glu Thr Phe Arg Pro 
450 455 460 

Thr Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
465 470 475 480 

Lys Val Val Glu Leu Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys 
485 490 495 

Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He Gly Ala Val 
500 505 510 

Phe Leu Gly Phe Leu Gly Thr Ala Gly Ser Thr Met Gly Ala Ala Ser 
515 520 525 

He Thr Leu Thr Val Gin Val Arg Gin Leu Leu Ser Gly He Val Gin 
530 535 540 

Gin Gin Ser Asn Leu Leu Lys Ala He Xaa Ala Gin Gin His Leu Leu 
545 550 555 560 

Lys Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala 
565 570 575 

Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys 
580 585 590 

Ser Gly Lys Leu He Cys Thr Thr Asn Val Pro Trp Asn Ala Ser Trp 
595 600 605 

Ser Asn Lys Ser Tyr Glu Asp He Trp Glu Asn Met Thr Trp He Gin 
610 615 620 

Trp Glu Arg Glu He Asn Asn Tyr Thr Gly He He Tyr Ser Leu He 
625 630 635 640 

Glu Glu Ala Gin Asn Gin Gin Glu Thr Asn Glu Lys Asp Leu Leu Ala 
645 650 655 

Leu Asp Lys Trp Thr Asn Leu Trp Asn Trp Phe Asn He Ser Asn Trp 
660 665 670 

Leu Trp Tyr He Lys He Phe He Met He He Gly Gly Leu He Gly 
675 680 685 

Leu Arg lie He Phe Ala Val Leu Ala He Val Asn Arg Val Arg Gin 
690 695 700 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu He Pro Asn Pro Thr Glu 
705 710 715 720 

Ala Asp Arg Pro Gly Gly He Glu Glu Gly Gly Gly Glu Gin Gly Arg 
725 730 735 



Thr Arg Ser He Arg Leu Val Asn Gly Phe Leu Ala Leu Ala Trp Asp 
740 745 750 



Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe 
755 760 765 



Val Leu lie Ala Ala Arg Thr Val 
770 775 

Glu lie Leu Lys Tyr Leu Val Asn 
785 790 

Leu Lys Asn Ser Ala lie Ser Leu 
805 

Ala Glu Gly Thr Asp Arg lie lie 
820 

Ala lie Leu His lie Pro Arg Arg 
835 840 

Leu Leu 
850 



Gly Thr Leu Gly Leu Arg Gly Trp 
780 

Leu Val Trp Tyr Trp Gly Gin Glu 
795 800 

Leu Asn Thr Thr Ala lie Ala Val 
810 815 

Glu lie Ala Gin Arg Ala Phe Arg 
825 830 

lie Arg Gin Gly Leu Glu Arg Ala 
845 



<210> 26 
<211> 204 
<212> PRT 

<213> Human immunodeficiency virus 
<220> 

<221> MOD_RES 
<222> (132) 

<223> Variable amino acid 
<400> 26 

Met Gly Asn Lys Trp Ser Lys Ser Trp Pro Gin Val Arg Asp Arg Met 
1 5 10 15 

Arg Arg Ala Ala Pro Ala Pro Ala Ala Asp Gly Val Gly Ala Val Ser 
20 25 30 

Gin Asp Leu Ala Lys His Gly Ala lie Thr Ser Ser Asn Thr Ala Ala 
35 40 45 

Thr Asn Asp Asp Cys Ala Trp Leu Glu Ala Gin Thr Glu Glu Glu Val 
50 55 60 

Gly Phe Pro Val Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys 
65 70 75 80 

Gly Ala Phe Asp Leu Ser Phe Phe Leu Lys Glu Lys Gly Gly Leu Asp 
85 90 95 

Gly Leu lie Tyr Ser Lys Lys Arg Gin Glu lie Leu Asp Leu Trp Val 
100 105 110 

His Asn Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly 
115 120 125 

Pro Gly Thr Xaa Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val 
130 135 140 



Pro Val Asp Pro Ser Glu Val Glu Glu Ala Asn Glu Gly Glu Asn Asn 
145 150 155 160 



Cys Leu Leu His Pro Ala Cys Gin His Gly lie Glu Asp Glu Glu Arg 
165 170 175 



Glu Val Leu Lys Trp Lys Phe Asp 
180 

Ala Arg Glu Leu His Pro Glu Phe 
195 200 



Ser Ser Leu Ala Arg Arg His lie 
185 190 

Tyr Lys Asp Cys 



<210> 27 
<211> 158 
<212> PRT 

<213> Human papillomavirus 
<400> 27 

Met Ala Arg Phe Glu Asp Pro Thr Arg Arg Pro Tyr Lys Leu Pro Asp 
15 10 15 

Leu Cys Thr Glu Leu Asn Thr Ser Leu Gin Asp He Glu He Thr Cys 
20 25 30 

Val Tyr Cys Lys Thr Val Leu Glu Leu Thr Glu Val Phe Glu Phe Ala 
35 40 45 

Phe Lys Asp Leu Phe Val Val Tyr Arg Asp Ser He Pro His Ala Ala 
50 55 60 

Cys His Lys Cys lie Asp Phe Tyr Ser Arg lie Arg Glu Leu Arg His 
65 70 75 80 

Tyr Ser Asp Ser Val Tyr Gly Asp Thr Leu Glu Lys Leu Thr Asn Thr 
85 90 95 

Gly Leu Tyr Asn Leu Leu lie Arg Cys Leu Arg Cys Gin Lys Pro Leu 
100 105 110 

Asn Pro Ala Glu Lys Leu Arg His Leu Asn Glu Lys Arg Arg Phe His 
115 120 125 

Lys He Ala Gly His Tyr Arg Gly Gin Cys His Ser Cys Cys Asn Arg 
130 135 140 

Ala Arg Gin Glu Arg Leu Gin Arg Arg Arg Glu Thr Gin Val 
145 150 155 



<210> 28 
<211> 105 
<212> PRT 

<213> Human papillomavirus 
<400> 28 

Met His Gly Pro Lys Ala Thr Leu Gin Asp He Val Leu His Leu Glu 
15 10 15 

Pro Gin Asn Glu He Pro Val Asp Leu Leu Cys His Glu Gin Leu Ser 
20 25 30 

Asp Ser Glu Glu Glu Asn Asp Glu He Asp Gly Val Asn His Gin His 
35 40 45 

Leu Pro Ala Arg Arg Ala Glu Pro Gin Arg His Thr Met Leu Cys Met 



50 55 60 

Cys Cys Lys Cys Glu Ala Arg lie Glu Leu Val Val Glu Ser Ser Ala 
65 70 75 80 

Asp Asp Leu Arg Ala Phe Gin Gin Leu Phe Leu Lys Thr Leu Ser Phe 
85 90 95 

Val Cys Pro Trp Cys Ala Ser Gin Gin 
100 105 



<210> 29 
<211> 843 
<212> DNA 

<213> Human papillomavirus 



<400> 29 

cggtgtatat 

acacggcgac 

atagaaataa 

gcattcaaag 

tgtatagatt 

gacacattag 

tgccagaaac 

cacaaaatag 

gagagactcc 

caacattgca 

tatgtcacga 

atcaacattt 

gtaagtgtga 

tccagcagct 

aag 



aaaagatgtg 
cctacaagct 
cctgtgtata 
atttatttgt 
tttattctag 
aaaaactaac 
cgttgaatcc 
ctgggcacta 
aacgacgcag 
agacattgfca 
gcaattaagc 
accagcccga 
agctagaatt 
gtttctgaaa 



agaaacgcac 
acctgatctg 
ttgcaagaca 
ggtgtataga 
aattagagaa 
taacactggg 
agcagaaaaa 
tagaggccag 
agaaacacaa 
ttgcatttag 
gactcagagg 
cgagccgaac 
gagctagtag 
accctgtcct 



cacaatacta 
tgcacggaac 
gtattggaac 
gacagtatac 
ttaagacatt 
ttatacaatt 
cttagacacc 
tgccattcgt 
gtataatatt 
agcctcaaaa 
aagaaaacga 
cacaacgtca 
tagaaagctc 
ttgtgtgtcc 



tggcgcgctt 
tgaacacttc 
ttacagaggt 
cgcatgctgc 
attcagactc 
tattaataag 
ttaatgaaaa 
gctgcaaccg 
aagtatgcat 
tgaaattccg 
tgaaatagat 
cacaatgttg 
agcagacgac 
gtggtgtgca 



tgaggatcca 
actgcaagac 
atttgaattt 
atgccataaa 
tgtgtatgga 
gtgcctgcgg 
acgacgattc 
agcacgacag 
ggacctaagg 
gttgaccttc 
ggagttaatc 
tgtatgtgtt 
cttcgagcat 
tcccagcagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

843 



<210> 30 
<211> 11835 
<212> DNA 

<213> Epstein Barr virus 
<400> 30 

ggccgctgtt cacctaaagt gacgcaaggt ctgtcagccg ccagggtccg tttaccaggc 60 
tttcaggtgt ggaatttaga tagagtgggt gtgtgctctt gtttaattac accaagatca 120 
ccaccctcta tccatatccc acaattgata aacctccgca tgtccaacca ccacgttgaa 180 
caggatgtgg caccctaaga ggacgcaggc atacaaggtt attacccagt ccttgtatgc 240 
ctggtgtccc cttagtggga cgcaggccta ggtagcatca tttacactaa aagcagtgac 3 00 
cttgttggta ctttaaggtt ggtccaatcc ataggctttt tttgtgaaaa cccggggatc 360 
ggactagcct tagagtaact caaggccaag catttcacac ctgcaaatgc accatgtaac 420 
cacagatcta aactgaaagt tgcagcttta gatggcaagg aaacttgggt ttcaggcata 480 
gaaagcctgg ctcactatag cagcccatgt ttgttccagg gtgggggaaa ggcacgtgcc 540 
cttagaaaac ttagctgcaa aaattctatt gtgttgggag agcctctata tctaaaggcc 600 
tttcctcaca atacaaatgt tactaacgtc tgccctctgg agacctgcta tgtggctaga 660 
cgtatggcct acccaagacg ttgggggtct cgggtaggcc atgattcttc caggcatagg 720 
ttacaaccag tcactgctat caagcctact cagttcccaa cgcagcacat accccccgcc 780 
tctcctgcca tgaggactta tggcagtgtt tactgttctg cttttactct tggaccaggc 840 
tgtcattcta tcagaataac aggggaagca aggccccctg cttcagcggg acacgtgttt 900 
ctagaatctc ggagccaata actacctgcc cctctaatct gtatgctgca tgaaaaacca 960 
catacacgtg atgtaagttt agccagttta ttgttacacc aatgccccga aagtctcccc 1020 
ctgtcccttt gggtctcagg acccagccct ggagctcggg ggcggccggg tggcccaccg 1080 
ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg 1140 
gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg 1200 
ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc 1260 
ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg 132 0 



tggccggctg 
ccaccgggtc 
tccggggttc 
ccgctgcccc 
gagctcgggg 

ggggggtggc 

ggtggcccac 
gccgggtccg 
ctgggtccgc 
gccctggagc 
tccggcgggg 
cggccgggtg 
gctgcagccg 
ggtccgctgg 
gttccggccc 
ccccgctccg 

ggggggcggc 

tggccggctg 
ccaccgggtc 
tccggggttc 
ccgctgcccc 
gagctcgggg 
ggggggtggc 

ggtggcccac 
gccgggtccg 
ctgggtccgc 
gccctggagc 
tccggcgggg 
cggccgggtg 
gctgcagccg 
ggtccgctgg 
gttccggccc 
ccccgctccg 

ggggggcggc 

tggccggctg 
ccaccgggtc 
tccggggttc 
ccgctgcccc 
agggaccccg 
tgggaggtgt 
cttttatcct 
cttctcattg 
accaggtggg 
ggcccgcgag 
ggagggggag 
tccatagggt 
ggagctaggt 
gcgccaccct 
gtggaatagg 
aaccctatag 
atcccccacc 
ctttgcacat 
gccttgtccc 
gagccactaa 
taatccatga 
ccagtcttca 
tggcgctggt 
gcagcctcta 
tcctgcatgt 
ttgggagcct 
actggcagcc 
gaatctcagc 
caagatttgt 



cagccgggtc 
cgctgggtcc 
cggccctgga 
gctccggcgg 
ggcggccggg 
cggctgcagc 
cgggtccgct 
gggttccggc 
tgccccgctc 
tcggggggcg 
ggtggccggc 
gcccaccggg 
ggtccggggt 
gtccgctgcc 
tggagctcgg 
gcggggggtg 
cgggtggccc 
cagccgggtc 
cgctgggtcc 
cggccctgga 
gctccggcgg 
ggcggccggg 
cggctgcagc 
cgggtccgct 
gggttccggc 
tgccccgctc 
tcggggggcg 
ggtggccggc 
gcccaccggg 
ggtccggggt 
gtccgctgcc 
tggagctcgg 
gcggggggtg 
cgggtggccc 
cagccgggtc 
cgctgggtcc 
cggccctgga 
gctccggcgg 
cgccacccgc 
gcaccccccg 
ctttttgggg 
gttaattcag 
tggtacctgg 
atttggggta 
gatcgggctc 
tccattatcc 
aggattcagc 
tcctccttcc 
gtattgcagc 
tgtaatccct 
ccagcacccc 
ttggtcagct 
atggacaatg 
gcccccgttg 
gccccgcctt 
attttgggga 
taagctgacg 
actttggctg 
ctacttgcct 
gtaggggcca 
gagcagattc 
caggacctat 
catgcacacc 



cggggttccg 
gctgccccgc 
gctcgggggg 

ggggtggccg 

tggcccaccg 
cgggtccggg 
gggtccgctg 
cctggagctc 
cggcgggggg 
gccgggtggc 
tgcagccggg 
tccgctgggt 
tccggccctg 
ccgctccggc 
ggggcggccg 
gccggctgca 
accgggtccg 
cggggttccg 
gctgccccgc 
gctcgggggg 
ggggtggccg 
tggcccaccg 
cgggtccggg 
gggtccgctg 
cctggagctc 
cggcgggggg 
gccgggtggc 
tgcagccggg 
tccgctgggt 
tccggccctg 
ccgctccggc 
ggggcggccg 
gccggctgca 
accgggtccg 
cggggttccg 
gctgccccgc 
gctcgggggg 
ggatgggggt 
ttcatggggg 
agcgtctgga 
tctctgtgca 
gtgtgtgatt 
aggtcattct 
ggcggagcct 
cgccccgata 
tggaggtatc 
taggttccta 
gttttaatgg 
tgggtatata 
ccccccccta 
ggggtgacgt 
gaccgatgct 
tccctccagc 
ctcgccttgc 
taggaagcac 
gtggttttgt 
acctcccaag 
tggcctctat 
cctgtggtgg 
acacccttgg 
taatgggcgc 
acctgagact 
tgcacacctg 



gccctggagc 
tccggcgggg 
cggccgggtg 
gctgcagccg 
ggtccgctgg 
gttccggccc 
ccccgctccg 
ggggggcggc 
tggccggctg 
ccaccgggtc 
tccggggttc 
ccgctgcccc 
gagctcgggg 
ggggggtggc 
ggtggcccac 
gccgggtccg 
ctgggtccgc 
gccctggagc 
tccggcgggg 
cggccgggtg 
gctgcagccg 
ggtccgctgg 
gttccggccc 
ccccgctccg 
ggggggcggc 

tggccggctg 
ccaccgggtc 
tccggggttc 
ccgctgcccc 
gagctcgggg 
ggggggtggc 
ggtggcccac 
gccgggtccg 
ctgggtccgc 
gccctggagc 
tccggcgggg 
cggccgggtg 
gcgctcccag 
gggaggccgc 
cgacgctggc 
ataccttaag 
ttagcccgtt 
attgggataa 
caggagggtc 
tacctagtgg 
ctaagctccg 
ctggggtacc 
tagaataacc 
cctataggta 
cccccccctc 
ggcaccccgc 
cgccacttcc 
gtggtggctg 
ctgcctcacc 
cacgtcccgg 
gtgagccgga 
gtctctcacc 
ttcctccctt 
cagagcttgg 
tttgtttgtg 
ccgccttctt 
tcaaagtctg 
ttggtattgg 



tcggggggcg 
ggtggccggc 
gcccaccggg 
ggtccggggt 
gtccgctgcc 
tggagctcgg 
gcggggggtg 
cgggtggccc 
cagccgggtc 
cgctgggtcc 
cggccctgga 
gctccggcgg 
ggcggccggg 
cggctgcagc 
cgggtccgct 
gggttccggc 
tgccccgctc 
tcggggggcg 
ggtggccggc 
gcccaccggg 
ggtccggggt 
gtccgctgcc 
tggagctcgg 
gcggggggtg 
cgggtggccc 
cagccgggtc 
cgctgggtcc 
cggccctgga 
gctccggcgg 
ggcggccggg 
cggctgcagc 
cgggtccgct 
gggttccggc 
tgccccgctc 
tcggggggcg 
ggtggccggc 
gcccaccggg 
gccggaccct 
cgcaaggacg 
gagccgggcc 
gtttgctcag 
gggttacatt 
cgagaggagg 
ccctccatag 
gtggagccta 
cccctatata 
cccctaccct 
tataggttat 
tatagaacct 
ccttacggtt 
gtgccttact 
tgggtcatga 
cctttgggat 
atgacacact 
ggacggaagc 
agttggcaat 
ctgggtacac 
tcctagccag 
ccctgggccc 
ttcctgtttg 
tctctcttgt 
gtcctgggtt 
gtttctattc 



gccgggtggc 
tgcagccggg 
tccgctgggt 
tccggccctg 
ccgctccggc 
ggggcggccg 
gccggctgca 
accgggtccg 
cggggttccg 
gctgccccgc 
gctcgggggg 
ggggtggccg 
tggcccaccg 
cgggtccggg 
gggtccgctg 
cctggagctc 
cggcgggggg 
gccgggtggc 
tgcagccggg 
tccgctgggt 
tccggccctg 
ccgctccggc 
ggggcggccg 
gccggctgca 
accgggtccg 
cggggttccg 
gctgccccgc 
gctcgggggg 
ggggtggccg 
tggcccaccg 
cgggtccggg 
gggtccgctg 
cctggagctc 
cggcgggggg 
gccgggtggc 
tgcagccggg 
tccgctgggt 
ggtgccaggc 
ccgggccggc 
agctcgcctt 
gagtgggggg 
aaggtgtgta 
aggggctaga 
ggttgaacca 
gaggtaggta 
ccaggtgggt 
accttaaggt 
taacctagtg 
agaggaaggg 
gcctgagccc 
gacttgtcac 
cctggcctgt 
gcatcacttt 
aagcccctgc 
tggattttgg 
ggggtgaggg 
aggtggggcg 
ggccatgtgt 
aacccccgcc 
ctggcaactt 
tttattaata 
ctgagacccc 
ttgagtgtga 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 



aagtttgtaa aaaaattcat aaaatgtcac 
caattcttca gcctgcctat tttcaatttg 
taaccatttt cctgtctgac accactgcat 
ataaacatag atgtcttgtc ttgctcatgt 
gtttctggat tccaaattga gttgggggtg 
attagacagt ggacaccggg ctatatgtgg 
aaggtaatcc cccatcttaa cttcccttta 
gaatgattat ctaggtttga tagtctgaag 
caccttcttt accccccacg agtatccagt 
cattattccc ttgagtgtta tagcttcctc 
cttttttggt ggggctggtt gtcaggaaga 
ttagctttgt gttaggttag tatgggctgg 
tagacgtgct agttgtgccc actggtgttt 
gggggctgga tttggcagca gcacttgtgc 
gggggatgct agccaattta gcttcccctc 
ttgtcgcctg ccttcccccg cttccccttg 
aacgtggaag cctggtgggt gatccttcct 
aacagcccca cctgggtaaa atgtatgttc 
tgtagggtaa cgaagacctg atggaagtgg 
gctcaagttc gcacttccta tacagtgtta 
ccagtgtact cgataatgtc gactgctgcg 
gggggtccca aatcacgagg ttaggcaggt 
ccagacaatg tagtgcccct tttttttgca 
actaattctg ttctatgacc ctgtgttttt 
atttggacgc agctacttga cctttgcccc 
cactttgtgt tacaggtggg ccaaacctcc 
tgcataaggc agtaggtttg aggtgaccta 
ctggaccttg actatgaaac aattctaaaa 
gtgcactcgg aagtctcatc atctccgttt 
ctgtgccacg ccctggcatg ctgctgacat 
cccttgttca ccccatagca agaagggtag 
actggccggt gcaattagag gagagggcaa 
caagagctgc ggcggtcgat gggttcactg 
tagaactgtc acaatctatg gggtcgtaga 
gtgaatttgc tgctagctat atgggtggtg 
ggctttgtga tagctaatga atgtggcagc 
cagtaacttg gattctgtaa cacttcatgg 
tgaatatggg aggagggggg ctttgggttc 
ggtcctagtg ttatagggcg tggcagtttt 
tgtcacaggt gctagaccct ggagttgaac 
agtagttgtg accctgcaaa gctacgtggg 
cagacaggcg gcgcattgga accccagagg 
acatattttg ttattgtaca taaccatgga 
gggctggtgc ggacgcctgt gtttagtcta 
tccaaacgac cacacctttg aggacacctg 
acctggcttc tgttgggtca gacagtttgg 
caccaggctg actgccttag cagtgtggcc 
ggagtgcctt tcgcgtgaag catgccctga 
tgtgccgctg gatgagggag cgccaatttg 
cacccttgtg tctccctttg gcctttgcgg 
ccgtttatcc aatagcatgt aagagaggtt 
ccgtgtagct ggtgacctaa cacccgccca 
gtggtgtttg gtgtggtttt ggggtacgca 
gggatgggcc cgcttgtgcg cttatgtgtt 
tgccccgtga cccgtgggcc ttaccttcct 
cttgtgttgc tttctgccat aggggggaaa 
gaacatttga ggcctactgt atccgtgtcc 
ttaatttaaa catgttttgt ttgcttggga 
gattcatcgt ttcctgaact acaggcgtag 
tgcgtaccga atggcatgaa ggcacagcct 
cctccgtgat cctgggtcct ttggagactc 
agggtatctc gacgttggaa tgtcaactca 
tcgtacagac gattgtttgg ctctgtaact 



taattcctct tacctgttta gggtattgtg 5160 
cctaaggtgg caatttaaga tgtggttaat 5220 
gggcaaccgg gttccatggc acatttagag 5280 
gcagaggagg gggtgttggt gtgcaatata 5340 
ctattttcac tatggaatta aattactgac 5400 
ggatgtctgt ggcttgtcat ttcctcttag 5460 
aattgtgatg caagccctgg gttatttata 5520 
gctgggcaga gaatgtttgt aatttttatt 5580 
tctagaagat ctcctgatat cccgggctgc 5640 
ttaacttaag caagagctcc aggatgttag 5700 
ggttccagtg ttgtccttta tttttagatg 5760 
gtattcacta gtgaaggcaa ctaacacagt 5820 
atccggtccc aaatgtcacc acagaacaca 5880 
ttttgttgat ttttacccgt gtatcagagt 5940 
cccttaacag ggggtctcgc ggggtgccaa 6000 
ttaacttata gcatgatagg taggtcacct 6060 
cggtagggag cgcttagggc tgttgagctc 612 0 
taaagagtta cccaattata acaaaactgt 6180 
tattgttgcc gttgaaagac gggtgtcctg 6240 
aagccttgta tcggaagttt gggcttcgtc 6300 
aaaggtttgg accgtcttcc agtaggtgtt 6360 
gcacttggct ctttaggagg gacccttaag 642 0 
aattggcctt attattaatt tcttgttaac 6480 
cagatgccgt tgaacgtgtc actgagctga 6540 
cgtgcctcca gcgctgataa gtgctgcgtc 6600 
agaatatcaa ttggtggggc cttggtgggc 6660 
cttggaccat gtggatccag tgtcctgatc 6720 
aaatgcatca tagtccagtg tccagggaca 67 80 
gtgtgtttag tgtggccagt acggccaccc 6840 
ctggccgcca atttcagcgg gcccttttcc 690 0 
gttacatggg tattttccca tcagcacctg 6960 
caacgcaagg ctgttgtttt atttgggtta 7020 
attacggttt cctagattgt acagatgaac 7080 
cagtgtgctt accagacttc catggaagat 7140 
ctatgggctc cctagggact catgtagtgg 7200 
tgttgtttgt actggaccct gaattggaaa 7260 
gtcccgtagt gacaactatg ctgaatatct 7320 
cattgtgtgc cctttcctgg ccaacgtgag 7380 
cttgagggct aataacccgg gtgaggcggt 7440 
cagtaccact cggttacaaa gtcatggtct 7500 
gatgagcagc cagggacttt ggttggcaag 75 60 
agtgtcccgg ggccacctct ttggttctgt 762 0 
gttggctgtg gtgcactcca tctggtaagg 7 680 
tgccaatgtt tacctgcctt gggttactat 7740 
gagccctgat cattctcggc ttttactgcc 7800 
tgcgctagtt gtgtgcttag cagcaacgca 7860 
ctttattgtg gcatcctaag gagggattct 7920 
gacgtactcg agttaggact taatcgctcc 7980 
tacatcctag ctctggccat agagttagcc 8040 
tgccaatttc cggtggtttc ccttttccgc 8100 
gcctagattt ggcaactttg agggaacgtt 8160 
tcaccaccgg acagattctg aacttgtcct 8220 
ggagtacgtt ggaatgcttt ggagccgaga 82 80 
acacggtgcc aataaccggc ccggtgcggc 8340 
ggccatcggg ggaccctggt gctagggtcc 8400 
gcatcgcctt cagaattggc tgctccgttg 8460 
tgacaacatt ccccgcaaac atgacatggg 8520 
atgctcttag ggcctggaag cttgtcattg 8580 
ggcctattgt agcaggcatg tcttcattcc 8640 
gttaccattg gcaccttttt tccatgtaaa 8700 
aagtgtgaat ttgttttggt gttcggcgcc 8760 
acttgggcac ctcgataacc ggctcgtggc 8820 
tgccagggac ggctgacgat gtgtttagtc 8880 



tgccacttgc atccggcgct ttggttactc 
aggctggggg tgagtctggg gatgtccctg 
gatgagatgt tcaggggtgg ccggtaccct 
aatgctgcag tagtagggat ctggacgcgc 
ggagcgtcgc ctctggcccc atactcccgc 
cctggtgagc ccttgccgct ccccgcattc 
acagcaggcc tttccttccc ccgttaatga 
acagctttat tcagtttaca gagtgccctc 
gttgattctc tcaaaggtgg ggagggagga 
gagcgcgcag ccctgccgtt ggatgtatct 
ggcgtagatg aaaaaggcct gggccatgct 
ggaaatgttc cggaccgtca gcttgagggt 
gtagaagacc gggttggagg agtgggacac 
ggcctcgtag gtgttgttgt attccagctc 
ggagccaaag tagcggatgt ctgtggattc 
ggggttgtaa cggaaggcaa taatcacatc 
cagcgggccc tctcgtttca ggtccggcgt 
gcggcgtatg gcgtttaccg gacacacctt 
ctttagaaat gggggcgtgt gctgcgccag 
gacgggagag ccactgagga agatctgggg 
ggtgcaggag cgtgtcagct gaatgtcgct 
ggggttcttg accaatctga tgggcacaat 
caggctgcgc ggattcagga cgcttagcac 
gggcttgagg aatacctcgt tgtcttccgc 
gcggctgccg taggtggtcg tgttgaagga 
gcccaccgag gggcagggag gcagctcttg 
gttggtgacg cggacgtatg aggggaagat 
cgtgtctggt ttttcagtgg gccgcaggcg 
ggacgctagt gctgcatggg ctcctccggg 
acgggcaagg tgagagtcgg ggggcgggcg 
gttagctggt agaatatccg cccggcgtct 
agcgcaagtc caagtctggt gctggggccg 
actgtcattt ttatgggcga gtgggcggtc 
cacgccgcct cctggaatgt aaacccccag 
tgtgccatca catgtatttc ctccttgggt 
gggaatagcg tccaggcatc tttcagggtc 
ccctcccggg cggtgggcgg cccgggctcc 
gccctgtcca cctgtatgtc caggtgcacg 
aggcctggtg ccagccgacc acgtcccgtg 
cgggactgac tcagggccac atcccctcgg 
ccttcatggg cccgctttct acctctctcc 
gcatgacctt ggggtgggac ggggcaggcg 
acaccttcac tccttgttag gttgatagaa 
gcatagggaa aagccaggga gagcgatgtg 
cgcaatttgc ccccctgggc ggccgcagcc 
tcacgggggg cgcataggag gggtatatct 
agacacaggt cgtgaggatt taggcaacgc 
gcgtgcacct ggaagatgca gctggggtag 
gccagacact gaagggcaga gttcacggcg 
tacctggcca cggcc 



gggagactaa tggggggtgt ggtatggcac 8940 

ggcgttgctg cagcccattc gccctctggg 9000 

acgctgccga tttacataat ataaattgta 9060 

gacctgctac tcttcggaaa cgccaaccca 9120 

catgcgactg ctcgccccct cccaggcctc 9180 

ctgctttcgg cgcccctgcg gatcccgatg 9240 

aaagaatgac agtgaggttg tgacagaagg 93 00 

ggaggctacg atattcccgt taaatgtctt 9360 

gctctccaca acaatgttcc ctggcagcgt 9420 

tctcatgatg gtgctgatag aggggtctcc 9480 

ctggccggtc acgatcgtta tggggttgtt 9540 

ctggcccggc ttccactcct gtgggtagac 9600 

gacaacggcc gtaatcttgg agctcagggg 9660 

cgtgatgaaa ttaggaggaa taatcacagg 9720 

cccgtcccag cgccagtggc tcttagggta 9780 

atccaatagg gtcatgccca ccttgacgtt 9840 

gtccacggag actcggacgt agcccttacc 9900 

ccccgggaat gtgtgaatac gggcgtatga 9960 

caggtaaggc aggcactcgt cctggctggt 10020 

ctcgctggtg tttagcttgt ccccgctctg 10080 

ctgcccgggc agaatctgca ggtagaggta 10140 

gtaccaggta aacttccctt tctctatgaa 10200 

gatgtcctgg tcagagtgca taacgaagaa 10260 

tccaaagaac aaaaacgcga ccgtaaagta 1032 0 

gaaagaaggt aacttgaagc tgagtatctg 10380 

gcactgcgcg tccagctgca atacctgctt 10440 

ctcgtacttc cacacgcctc tcatgaacga 10500 

gcggaggctg ttcctgaacg acgagcgccg 10560 

gtaagcttcg gccatggccg gagctcgtcg 10620 

acggtgcggc cccaatacaa ctctccgctc 10680 

aggttgtcac ttcgctcggc cggccagaag 10740 

atgtgcagcg gtttgtgccc gcagttgtag 10800 

cacacgcgcg ggcgcagcac ccattggtcg 10860 

agagagggcg tgccgccctg gagatggccc 1092 0 

ggaacaacgg cgtcgtgctc cgggtggagg 109 80 

acgagaccgg ggtccatgct cagagaacag 11040 

agcagaacgt cgcagaccca gccctcctcg 11100 

gacccggagg ctgcgtctcg tgacatggcc 11160 

tcccagccga ggccgcgcca gagcagagcc 1122 0 

cccgcggacg ccgcctcgcc agcccccggg 112 80 

ggcaccccag cctggtcagc cgcagaggaa 11340 

tgatcctggg cgcaatcttt gccgatcccc 11400 

tgtcggtacc acgccacggg gggcgggccc 11460 

ggcgaggatg ggctcaggcg gccccagaca 1152 0 

tgcccctcgg cggcccgtgc cccagctccg 11580 

aggatagccg cacctacaca aatgagacac 11640 

aggcttgtct ttatagttac aaacatggga 11700 

atctttacat ctttacaggg cgcagcggcc 11760 

ggcacctccc agagggagcc caccagcccg 1182 0 

11835 



<210> 31 
<211> 2337 
<212> DNA 

<213> Plasmodium falciparum 
<400> 31 

aaaaaagaaa attataaata aatatatata 
attataaatt acaattcatg atgagaaaat 
ttgttgaggc cttattccag gaataccagt 
taaatgaatt aaattatgat aatgcaggca 
attatgggaa acaggaaaat tggtatagtc 



ttcgtgtaaa aataagtaga aaccacgtat 60 
tagctatttt atctgtttct tcctttttat 120 
gctatggaag ttcgtcaaac acaagggttc 180 
ctaatttata taatgaatta gaaatgaatt 240 
ttaaaaaaaa tagtagatca cttggagaaa 300 



atgatgatgg 
gagatggaaa 
aaccagggga 
caaatgtaga 
caaatgcaaa 
caaatgcaaa 
caaatgcaaa 
caaatgcaaa 
ccaatgcaaa 
caaacgcaaa 
caaatgcaaa 
gacaaggtca 
acaatgctgt 
taaagaaaat 
atggtattca 
atgaaaatga 
tcgtaaatag 
agaacacatc 
aattttcatt 
gtttgcgatt 
caaaaacaac 
atagttataa 
catttttgtt 
tgtatcttaa 
ctttttattt 
tttgtataaa 
cataagacgt 
tgtgaacact 
tataggattt 
aataataaac 
taatattata 
atcgaatcca 
tcgatggttt 
tggtgctggc 



aaataataat 
taacgaagac 
tggtaatcct 
tccaaatgca 
cccaaatgca 
cccaaatgca 
tcctaatgca 
cccaaatgca 
tcctaatgca 
ccccaatgca 
cccaaatgca 
caatatgcca 
aaaaaataat 
aaaaaattct 
agttagaata 
tattgaaaaa 
ttcaatagga 
ttagtttgag 
tttctttata 
aattgtagaa 
aacaaaaaaa 
atattttcaa 
tatttattta 
aaatagtgaa 
tctaaaataa 
tatataaaca 
atacgcttta 
aaaattgata 
acaacaaata 
gataaaataa 
atacagataa 
tgtaatgatc 
agcgatttat 
agtggtttgg 



aatggagata 
aacgagaaat 
gatccaaatg 
aacccaaatg 
aacccaaatg 
aaccccaatg 
aacccaaatg 
aacccaaacg 
aatcctaatg 
aatcctaatg 
aaccccaatg 
aatgacccaa 
aataacgaag 
atttcaactg 
aagcctggct 
aaaatttgta 
ttaataatgg 
ttgtacaata 
ttttcctatt 
atatatatgt 
aaaaaaaaaa 
aaatatttat 
tttatatatt 
ctatatatat 
cttaaaaatt 
tagatatatt 
tataatacaa 
ataatgatta 
ttcatgaagc 
agaaagaaga 
gagaagttgg 
ttacaggtat 
ctttacagaa 
ctagtataat 



atggtcgtga 
taaggaaacc 
caaacccaaa 
tagatccaaa 
caaacccaaa 
caaatcctaa 
caaacccaaa 
caaaccccaa 
ccaatccaaa 
ccaatccaaa 
caaatcctaa 
accgaaatgt 
aaccaagtga 
aatggtcccc 
ctgctaataa 
aaatggaaaa 
tattatcctt 
tttataaaaa 
taatttattt 
atatactata 
aaaaaaaaag 
aacacaaaaa 
tcatttttac 
aaaatattaa 
atatgtttaa 
aaataaaata 
caatattaat 
taatacagaa 
aaaaataatt 
atatgatgat 
acttaattat 
taatatatgg 
ttttttttcg 
aatatttata 



aggtaaagat 
aaaacataaa 
tgtagatccc 
tgcaaaccca 
tgcaaaccca 
tgcaaatcct 
cgtagatcct 
tgcaaatcct 
tgcaaatcca 
tgcaaatcca 
taaaaacaat 
agatgaaaat 
taagcacata 
atgtagtgta 
acctaaagac 
atgttccagt 
cttgttcctt 
tatatactac 
ttttgtgaat 
tttatagaat 
aaaaaaggat 
atacttcgaa 
gtatttatat 
tttaaaaaaa 
gaaaggggta 
acaaatgtac 
tgtaataata 
gaaataaaaa 
caagaaaaga 
tgtaataata 
tttggagata 
gaaagttgtc 
aataaaaata 
tattctaata 



gaagataaaa 
aaattaaagc 
aatgccaacc 
aatgcaaacc 
aatgcaaacc 
aatgcaaacc 
aatgcaaatc 
aatgcaaacc 
aatgcaaacc 
aatgcaaacc 
caaggtaatg 
gctaatgcca 
gaacaatatt 
acttgtggaa 
gaattagatt 
gtgtttaatg 
aattagataa 
tttttttctt 
atttaattac 
gtgttattct 
taaaagtaaa 
gttcatttaa 
tataaaatgg 
ttataacttt 
aattataata 
tatatttgtg 
tttgtggtag 
atgaatccaa 
catatggatt 
ataataataa 
ctctcgatga 
ttgtggctag 
ttttagaaat 
tttacaa 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2337 



<210> 32 
<211> 729 
<212> DNA 

<213> Plasmodium falciparum 



<400> 32 

cgaaaaatat 

aagattttat 

tgtagatttt 

ctcttttctt 

gaagtggtgt 

tacacgattt 

gaaaatccaa 

ccgtattatt 

tcaaaatagg 

atggagaacc 

aaggtgacga 

ttaatgggtt 

tgctatttt 



ttaattatct 
aataattaag 
taatttattt 
tatcattttc 
tagcagcaaa 
aatatctgat 
atataaactt 
aggaggtgtt 
atcaagcgac 
aaatgcagac 
caacaacctc 
tttttgaaac 



aaataaattt 
ttttaatttc 
aatatattca 
aataaagaat 
aaaaaaaata 
atgatcaaaa 
gccacttcag 
ggtttagtat 
ccagctgata 
ccacaagtta 
gtaagtggcc 
acgtgaaaat 



aattaaaaat 
ttttgatcca 
aaatgaaaat 
ccttagccga 
aaaaaggatc 
aagaagaaga 
tacttgcagg 
tatacaatac 
atgctaaccc 
cagctcaaga 
ctgaacacta 
aatttttatt 



ttttataaca 
aagtttttaa 
cttatcagta 
aaaaacaaac 
aggtgaacca 
acttgttgaa 
tttattaggt 
tgaaaaagga 
agatgctgat 
tgttacacca 
aacagctgta 
tatgattata 



tattttattt 
taattaaatt 
ttttttcttg 
aaaggaactg 
ttaatagatg 
gttaacaaaa 
gtagtatcca 
agacacccat 
tctgaatcca 
gagcaaccac 
aacttttttg 
ttatatatat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

729 



<210> 33 

<211> 594 

<212> DNA 

<213> Homo sapiens 



<400> 33 



atgggtgccc cgacgttgcc ccctgcctgg 
acattcaaga actggccctt cttggagggc 
gctggcttca tccactgccc cactgagaac 
ttcaaggagc tggaaggctg ggagccagat 
tcgtccggtt gcgctttcct ttctgtcaag 
tttttgaaac tggacagaga aagagccaag 
tgaaaaaatt ggaagccaga ttcagggagg 
aagagcctga tgtttgccag ggtctgttta 
tcctccagga gtttcaggag aaaggcaaag 
gaaactgcgg agaaagtgcg ccgtgccatc 



cagccctttc tcaaggacca ccgcatctct 60 
tgcgcctgca ccccggagcg gatggccgag 12 0 
gagccagact tggcccagtg tttcttctgc 180 
gacgacccca tagaggaaca taaaaagcat 240 
aagcagtttg aagaattaac ccttggtgaa 3 00 
aacaaaattg agagagctct gttagcagaa 360 
gactggaagc aaaagaattt ctgttcgagg 42 0 
actggacatg aagaggaagg ctctggactt 480 
gaaaccaaca ataagaagaa agaatttgag 540 
gagcagctgg ctgccatgga ttga 594 



<210> 34 

<211> 1605 

<212> DNA 

<213> Homo sapiens 

<400> 34 

gtggcggcgg cggcatgggt gccccgacgt 
accaccgcat ctctacattc aagaactggc 
agcggatggc cgaggctggc ttcatccact 
agtgtttctt ctgcttcaag gagctggaag 
aacataaaaa gcattcgtcc ggttgcgctt 
taacccttgg tgaatttttg aaactggaca 
aaaccaacaa taagaagaaa gaatttgagg 
agcagctggc tgccatggat tgaggcctct 
caccacttcc agggtttatt ccctggtgcc 
tcttaggaaa ggagatcaac attttcaaat 
ttgaaagtgg caccagaggt gcttctgcct 
gcttctctct ctctctctct tttttggggg 
accaggtgag aagtgaggga ggaagaaggc 
gttcgcgtgg gcagagcctt ccacagtgaa 
cacagtcctg agtgtggact tggcaggtgc 
tgtcacacct gtgcctcctc agaggacagt 
tttttggtag atgcatgact tgtgtgtgat 
ctactgttta acaacatggc tttcttattt 
aaactacaat taaaactaag cacaaagcca 
tcaggtggat gaggagacag aatagagtga 
cactgctgtg tgattagaca ggcccagtga 
tcagaaaaag gcagtggcct aaatcctttt 
tggctgggct gctgcaggcc gtgtgtctgt 
cacacggggg agagacgcag tccgcccagg 
gcagggctga agtctggcgt aagatgatgg 
ctgcagggtg gattgttaca gcttcgctgg 
gagaaataaa aagcctgtca tttcaaacac 



tgccccctgc ctggcagccc tttctcaagg 60 
ccttcttgga gggctgcgcc tgcaccccgg 12 0 
gccccactga gaacgagcca gacttggccc 180 
gctgggagcc agatgacgac cccatagagg 240 
tcctttctgt caagaagcag tttgaagaat 3 00 
gagaaagagc caagaacaaa attgcaaagg 3 60 
aaactgcgaa gaaagtgcgc cgtgccatcg 42 0 
ggccggagct gcctggtccc agagtggctg 480 
accagccttc ctgtgggccc cttagcaatg 540 
tagatgtttc aactgtgctc ctgttttgtc 600 
gtgcagcggg tgctgctggt aacagtggct 660 
ctcatttttg ctgttttgat tcccgggctt 720 
agtgtccctt ttgctagagc tgacagcttt 7 80 
tgtgtctgga cctcatgttg ttgaggctgt 840 
ctgttgaatc tgagctgcag gttccttatc 900 
ttttttgttg ttgtgttttt ttgttttttt 960 
gagagaatgg agacagagtc cctggctcct 102 0 
tgtttgaatt gttaattcac agaatagcac 1080 
ttctaagtca ttggggaaac ggggtgaact 1140 
taggaagcgt ctggcagata ctccttttgc 12 00 
gccgcggggc acatgctggc cgctcctccc 12 60 
taaatgactt ggctcgatgc tgtgggggac 132 0 
cagcccaacc ttcacatctg tcacgttctc 1380 
tccccgcttt ctttggaggc agcagctccc 1440 
atttgattcg ccctcctccc tgtcatagag 1500 
aaacctctgg aggtcatctc ggctgttcct 1560 
aaaaaaaaaa aaaaa 1605 



<210> 35 

<211> 3107 

<212> DNA 

<213> Homo sapiens 

<400> 35 

atgatacagt ccaagcacct ggatgatgag 
cccgcggtcc cacccacctg gagagcatca 
gcccctcggc ccccgatggc cccgaggtcc 
cccccccgca acgtgaccct tctgcaggag 
attaaggact acaagaagat ccccatcaag 
gaatatgatg aacatttccc tgagatcatt 
ggtgcaggat gggagcagct ctgtggggga 
agcccctcaa ggcccagtct ctggagccat 
gcactggcag tttgactcgt gcttcctgcc 
gtctcgccgt ctggccaggt gggtctcatc 



tatgagagca gcgaggagga gagagagact 60 
cagccctcat tgacggtgcg ggctcagttg 120 
cagataccct caaggcacgt actgtgcctg 180 
agggcaaata agttggtgaa atacctgatg 240 
cgcgcagaca tgctgaagga tgtcatcaga 3 00 
gaacgagcaa cgtacaccct ggaaaaggtg 360 
agagcgggca tgggggtgcg gtgaccctgc 42 0 
ctctcacctc tccgactctg agcttccact 480 
ctcggcttct gtctctcatg ctctctgagt 540 
gcctctgcca gcgtcagctc ccacagcgaa 600 



ggtcttccgt gtgctgtctt cttctgccct 
gcagttctaa cccggaatca ctgtctgccg 
acactctgtt ttctccccca cgtagaagtt 
agaacacctg tatattcttg tctgcacacg 
caaggacact cccaggctga gtctcctctt 
caaccgtgcc agcgaggctg tcctctggga 
gtatgattgg cctctccagc tcctcccctc 
gattgcaata gcctggtggt ctggcgcaag 
caaaggatct caccagggcg gatggggaag 
gctctgtggc cccagatgac atctaagaga 
cctacctggg gcgcccctga ccaacagtcc 
cagcgactcc ccgggggctg ggaagtcata 
tctgctcatg gattcctttc cttgccctca 
ggaagctcat cacagatgac tttgtgaagc 
ctctcactcg agcatccttt gtgtgctggt 
tcttcaactg gagggatggg tgcctctgac 
agtgtgtggg gagccgaagg ccagggcggt 
aacacggtgt ggggtgtgaa aacggccgcc 
cacaagagtt ccccctggtt tacacctaag 
gagcatggct cccagctggt aagggcctca 
ccaccctctc cccgtccctg gaggacttgc 
agggtcagga cttgaggagg atgcctggag 
ctccagtcgt ttcctctggt gggcctctcc 
cgaggtgggg tgtgctcaga gcagggggcc 
ccaacaggaa gctggggtca tcgtgatgag 
aatgccctac atgtggaccc cctgcacctc 
aaggctttga aggcctcact tttgcaggca 
gaagggctgg aggggcagga cagtcctctt 
gaggggcccc ctggaaccca tccgttcagc 
actttcctct catgccaagg tacctggaat 
agtatgaatt cctctggggc ctgcgagccc 
gattcatcgc ccagaatcag aaccgagacc 
ctgtggatga tgctttcaag acaatggatg 
agatgagggc ccagatgaat atcggggatg 
acatacaagt cgagctcctg acctgggatg 
ggatcccctt tgctttctgg gccagatacc 
acaggagggc cagcttcttc tcctggatcc 
cagcgatctt actggccaag ccagagcgcc 
ctaggcggct tcttcctgtc agtcggaggt 
cccgctttca ttttgtgctt ttccttgtgt 
aagttgcaaa attaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



cgctcacgag tttggattcc ttgctgagga 660 
gcaggatgcc cagcatgggg tttggatctc 72 0 
tgggatccac ctgaaggaga tcgacaagga 7 80 
ggactcctca gctcgcctcc ttggaaaaac 840 
ggtgattctg ggcgtcatct tcatgaatgg 900 
ggcactacgc aagatgggac tgcgccctgg 960 
ggtgctatcc tctggccaaa gaggtcctgg 1020 
ggcgtggggt gccctgggct cggtagagag 1080 
cggtgctgga cgctgctcag ccctctctct 1140 
gacagtcaga gtcagggatt ccatcaaatc 1200 
tctggcctct gctgcatgcc caggcctcca 1260 
gtcatgctag ggagggcccc tgccaccgtc 1320 
gggtgaggca cccattcctc ggcgatctga 1380 
agaagtaagt atcacctgag ctaactgcgg 1440 
ctggctgaga aagcagttcc ctatcccaaa 1500 
ctgggagtga gtggcagtgg ggggtatgcg 1560 
cttgggaaaa gggagctcac gtcacctgag 162 0 
atcaccttga gcacctgccc tgtagactga 1680 
gaacccggag ctcagagagg agacgcctct 1740 
gcccaactct cctgattttc aggccagggg 1800 
caacgcacag gcgcgcatgc acaccaacaa 1860 
cacgcttctc ctggctgact gtttcttcct 1920 
agggctccgc cggggtgtgg ccaagaccct 1980 
tgaagaatgg ctcctctgtt tacaacacac 2040 
gggcacaaac ttgtggcctc cctacagaca 2100 
cgcatggctt ccggggagga ccaatggcaa 2160 
gaagtcctgg gagtgggttt gggaatgagt 2220 
ccaggagctg agctgcggca tcgggttgag 2280 
aacaggtctg cttggctagc agcaaagttt 2 340 
acaagaagat ccccaacagc aacccacctg 2400 
gccatgagac cagcaagatg agggtcctga 2460 
cccgggaatg gaaggctcat ttcttggagg 2520 
tggatatggc cgaggaacat gccagggccc 2580 
aagcgctgat tggacggtgg agctgggatg 2 640 
aggacggaga ttttggcgat gcctgggcca 2700 
atcagtacat tctgaatagc aaccgtgcca 2760 
agtaagagtt tcggcaccgt tgacgaactg 2 82 0 
tcctctcaga ttccttctcg acacagcacc 2880 
ggcatgcaag atgaagctct ctttgctctt 2940 
tttcatgttt tgggtatcag tgttacatta 3000 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3 060 
aaaaaaaaaa aaaaaag 3107 



<210> 36 

<211> 2509 

<212> DNA 

<213> Homo sapiens 

<400> 36 

ccccactgtc tgccccgcag gctgtgcact 

gtcagccccc ggtgccacac gagattccaa 

cacaggcgct ggcctggcag gcccagaagg 

cccaggaggc ccagaggcag gctcccccct 

ccccgccctc ccaaaaagcc gtgcaaatcc 

gtccgcaagc ggaggtgccc acactgccgc 

tcttgcaggc ccagcccgga cccccggtag 

aatcattgat gactccatca ggagaatgca 

ctaaagagcg caggacctcc tcgaaggagc 

ttgctgccac cttctgtgct cccaaggcag 

cctggaaaaa cctgcctgcc acaccggaga 

ctacctccca gtttcagcct gcctctctga 

agaccccaaa gtcactgcca tatgctctgc 



gcccttccat catctggcag gcccccaaag 60 
cgtcaatgga attccaggag gtgcagcaga 120 
cccccactca catctggcag cccctgcctg 180 
tggtccagct ggagcagccc tttcagggag 240 
agctaccccc ccagcaggcc caggcatcgg 3 00 
tccagccttc ctggcaggca ccgcctgcag 360 
cagcggcaaa ttttcccctg ggctccgcta 420 
gggcctcttc tatagaccgc aggggctcct 480 
gcagggcccc ttcaaaagac cgcatgatct 540 
tgtcagctgc gcgagcacac ctgccagctg 600 
cctttgctcc ctcctcaagt gtcttcccag 660 
atgcctttaa aggcccctct gctgcctcag 720 
aggatccctt tgcctgtgta gaggccctgc 780 



ctgcagttcc atgggtccca cagcccaata 
ccaccttcct gatggctaca gcagctgccc 
ccaagacctc cgtcgagccg ccacgccgct 
tggaagccca agaggacagc cgtggccaca 
ggccctggga gaatctaaat ctgagtgact 
gtgactggga gcacccaaac accccccgtg 
ccaggatcct gagtggctgg gaagggccca 
gcccgagcac ctccagggcc ctgggtctct 
ttgtgtctga ggtcgcaagt gtctctccgg 
tggaggcaca gcccttgtct cccttggatg 
tagtcaagga ccaagccaag gtgcctgtcc 
gagagtataa agatgagtgc ttagatatca 
cctttggtta tcaattgaaa gaaattgata 
agctgggcta ccatacaggg aatttggtgg 
ttctgatggt ggtcttgagc ctcatcttta 
tctttaattt tctgttcaag ttagggttgg 
atactaagaa gctcatcacc gaagtgtttg 
tcccttacac tgagcccgca gagtatgagt 
ccagcaagat gcttgtcctg aggtttttgg 
ggccattcca ttaccttgaa gcgctcgcag 
aacctgacac cggtgacagt gcccacggcc 
tgtagcagag atctcgctcc tgtgtttccc 
catttttgtt cctggtgttt gtgttccagt 
ggtttcgtat ctgccaaagc tttgtacatt 
tgttctgttc tgtattttgg catctgtgtt 
tgttataatt gttatgtttt ggtatcagct 
gctatccatc tcatttctgg tatagttcat 
ttcaaatgac tcatcagtaa aaactgtcta 
taaatatact ttcagcataa aaaaaaaaaa 



tgaatgcctc aaaggcatcg caggcagtgc 840 
cccaggcaac tgccaccact caagaggcct 900 
ccggcaaggc cacccggaag aagaagcatc 960 
cgctagcctt tcatgactgg cagggcccaa 1020 
gggaggtcca aagccctatc caggtctcgg 1080 
gcctgagtgg ttgggagggc cctagcacct 1140 
gcgcatcctg ggccctgagt gcctgggagg 1200 
ctgaaagccc agggagctct ctgcccgtag 12 60 
gatccagtgc cacccaggat aattccaagg 132 0 
agagggcaaa tgcgttggtg cagttcctct 1380 
agcgctcgga gatggtgaaa gtcatcctcc 1440 
tcaaccgtgc caacaataag ctggagtgtg 1500 
ccaaaaacca cgcctatatt atcatcaaca 1560 
catcctattt agacaggccc aagtttggcc 1620 
tgaaaggcaa ctgtgtcagg gaggatctga 1680 
atgtccggga gacaaacggt ctctttggaa 1740 
tcaggcagaa gtacctagag tacaggcgaa 1800 
tcctctgggg ccctcgagca ttcctggaaa 1860 
ccaagctcca taagaaagat ccacagagct 1920 
agtgtgagtg ggaagacaca gatgaggatg 1980 
ccaccagcag gccccctccc cgctaatagg 2040 
tggccagagg ccactgacag ggtgggggga 2100 
tccacgagtg taagtttgga ttttcaactt 2160 
ttttatgtgg tgttgatttc aatcggctac 2220 
tttaagtgag atctgtggtt ctctgttttg 2280 
ttgtgctggc tttgtgaaat gaattgagaa 2340 
gtagcattgt aatcggttgt tctttgaacg 2400 
cagagaagta aatatctata tctatatata 2460 
aaaaaaaaaa aaaaaaaaa 2 509 



<210> 37 

<211> 1782 

<212> DNA 

<213> Homo sapiens 

<400> 37 

agtgttgcaa ctgggcctgg catgtttcag 
gtgaagtctg aggttgcaaa aggactgtga 
gaaattctca gaaaacagga ccttgatgtg 
ccacatctcc tgcctttctg ctcactttcc 
cctcggggtc agaagagtaa gctccgtgct 
acccagggtc tcaaggttgc tcacgccact 
tctcctgttt taggggatac tcccacaagc 
cagggagctc cacccaccac cactgctgct 
ggtgccaaat gccaaggtga ggaaaatgca 
agctcagtca aagatcctgt agcctgggag 
aagtataaaa tgagagagcc cattatgaag 
tacaaggatc acttcactga gatcctcaat 
ggccttgatt tgaaggaaga caaccctagt 
aacctcacca atgatggaaa cctgagcaat 
atgcctctcc tgggtgtgat cttcttaaag 
aaattcatga atgtgttggg agcctatgat 
cgtaagttca tcacccaaga tctggtgcag 
aacagtgatc ccccacgcta tcaattccta 
aagatgaaag tcctcgagtt tttggccaag 
tcccattatg aagaggcttt gagagatgag 
agagccaggc gtcgcactac tgccacgact 
aggtcctccc accccatgtg agaactcagg 
tgccaacctt ttgaagtagt gagcagccaa 
tcctttgtgt tcctgttaaa cattagtatc 
tatttagagt tgggatctat gtctatgagc 
agctttaagc ataagagttt tgatattcta 



tgtggtgtcc agcagtgtct cccactcctt 60 
tcatatgaag atcatccagg agtacaactc 120 
agaggagcag gttcaggtaa acaaagggtg 180 
tgcctgtttt gcctgaccac agccatcatg 240 
cgtgagaaac gccgcaaggc gcgagaggag 3 00 
gcagcagaga aagaggagtg cccctcctcc 3 60 
tcccctgctg ctggcattcc ccagaagcct 420 
gcagctgtgt catgtaccga atctgacgaa 480 
agtttctccc aggccacaac atccactgag 540 
gcaggaatgc tgatgcactt cattctacgt 600 
gcagatatgc tgaaggttgt tgatgaaaag 660 
ggagcctctc gccgcttgga gctcgtcttt 720 
ggccacacct acaccctcgt cagtaagcta 780 
gattgggact ttcccaggaa tgggcttctg 840 
ggcaactctg ccaccgagga agagatctgg 900 
ggagaggagc acttaatcta tggggaaccc 960 
gaaaaatatc tgaagtacga gcaggtgccc 102 0 
tggggtccga gagcctatgc tgaaaccacc 1080 
atgaatggtg ccactccccg tgacttccca 1140 
gaagagagag cccaagtccg atccagtgtt 1200 
tttagagcgc gttctagagc cccattcagc 1260 
cagattgttc actttgtttt tgtggcaaga 1320 
gatatggcta gagagatcat catatatatc 1380 
tttcaagtgt ttttctttta atagaatgtt 1440 
gacatggatc acacatttat tggtgctgcc 1500 
tatttttcaa atccttgaat cttttttggg 1560 



ttgaagaaga agaaagcata gctttagaat 

ctcacacaac ataattggag tcttaaaata 

ttgttttctg cattcagttt tgtttttgta 

agccttttca agaatgcaga taaaataaat 



agagattttc tcagaaatgt gtgaaagaac 162 0 
gaggaagagt aagcaaagca tgtcaagttt 1680 
aaatccaaag atacatacct ggttgttttt 1740 
agtaataaat ta 17 82 



<210> 38 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 38 

Gly Gly Gly Gly Ser 
1 5 



<210> 39 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 39 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
15 10 15 



<210> 40 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 40 

acagacgtag cacagactat ggtattctcc agattaacag caggtattat gacggtagga 60 
catgataggc 7 0 



<210> 41 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 



<400> 41 

atagtttagc ggcccgcgat atctcactaa cactcattcc tgttgaagct 



50 



